GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech

Katsuhiko Yamamoto,Toshio Irino,Shoko Araki,Keisuke Kinoshita,Tomohiro Nakatani

doi:10.1016/j.specom.2020.06.001

Katsuhiko Yamamoto, Toshio Irino + Show 3 more

Open Access

https://doi.org/10.1016/j.specom.2020.06.001

Copy DOI

Abstract

In this study, we propose a new concept, the gammachirp envelope distortion index (GEDI), based on the signal-to-distortion ratio in the auditory envelope, SDRenv, to predict the intelligibility of speech enhanced by nonlinear algorithms. The objective of GEDI is to calculate the distortion between enhanced and clean-speech representations in the domain of a temporal envelope extracted by the gammachirp auditory filterbank and modulation filterbank. We also extend GEDI with multi-resolution analysis (mr-GEDI) to predict the speech intelligibility of sounds under non-stationary noise conditions. We evaluate GEDI in terms of the speech intelligibility predictions of speech sounds enhanced by a classic spectral subtraction and a Wiener filtering method. The predictions are compared with human results for various signal-to-noise ratio conditions with additive pink and babble noises. The results showed that mr-GEDI predicted the intelligibility curves better than short-time objective intelligibility (STOI) measure, extended-STOI (ESTOI) measure, and hearing-aid speech perception index (HASPI) under pink-noise conditions, and better than HASPI under babble-noise conditions. The mr-GEDI method does not present an overestimation tendency and is considered a more conservative approach than STOI and ESTOI. Therefore, the evaluation with mr-GEDI may provide additional information in the development of speech enhancement algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Jun 17, 2020
Citations: 29

Similar Papers

Performance analysis of neural network, NMF and statistical approaches for speech enhancement
Ravi Kumar Kandagatla ... Venkata Subbaiah Potluri
International Journal of Speech Technology | VOL. 23
Ravi Kumar Kandagatla, et. al.Ravi Kumar Kandagatla ... Venkata Subbaiah Potluri
17 Sep 2020
International Journal of Speech Technology | VOL. 23

Speech intelligibility improvement in noisy environments based on energy correlation in frequency bands
Peyman Goli ... Mohammad Reza Karami-Mollaei
Digital Signal Processing | VOL. 62
Peyman Goli, et. al.Peyman Goli ... Mohammad Reza Karami-Mollaei
15 Dec 2016
Digital Signal Processing | VOL. 62

Japanese speech intelligibility estimation and prediction using objective intelligibility indices under noisy and reverberant conditions
Yosuke Kobayashi ... Kazuhiro Kondo
Applied Acoustics | VOL. 156
Yosuke Kobayashi, et. al.Yosuke Kobayashi ... Kazuhiro Kondo
31 Jul 2019
Applied Acoustics | VOL. 156

Modulation Wiener filter for improving speech intelligibility
Chung-Chien Hsu ... Tai-Shih Chi
-
Chung-Chien Hsu, et. al.Chung-Chien Hsu ... Tai-Shih Chi
01 Apr 2015
01 Apr 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GEDI: Gammachirp envelope distortion index for predicting intelligibility of enhanced speech

Abstract

Talk to us

Similar Papers

More From: Speech Communication