An MTF-based blind restoration of temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments

Xugang Lu,Masato Akagi,Masashi Unoki

doi:10.1121/1.2933278

Abstract

To reduce speech degradation in reverberant environments, we previously proposed a modulation transfer function (MTF) based method for speech dereverberation. It is based on the MTF relation that the sub-band temporal power envelope of reverberant speech can be represented as the convolution between temporal power envelopes of clean speech and the room impulse response. Therefore, the sub-band power envelope of clean speech can be estimated using inverse MTF filtering without measuring the room impulse response. We tested the effectiveness of this method as a front-end for automatic speech recognition (ASR) in both artificial and real reverberant environments. Reverberant speech signals were created by simple convolution of clean speech (AURORA-2J) and artificially-produced or real room impulse responses. The relative spectral filtering of the auditory-power-spectrum based method was used as a baseline. Compared with the baseline, our proposed method had 36.64% and 21.68% improvements in error reduction rate for artificial reverberant environments (reverberation times from 0.2 to 2.0 s) and real reverberant environments (43 reverberant impulse responses), respectively. These results indicate that our proposed method can be used as a robust front-end for ASR. [Work supported by a Grant-in-Aid for Science Research from the Japanese Ministry of Education (No. 18680017).]

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An MTF-based blind restoration of temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: May 1, 2008
Citations: 14

Similar Papers

Comparative evaluation of modulation-transfer-function-based blind restoration of sub-band power envelopes of speech as a front-end processor for automatic speech recognition systems
Xugang Lu ... Masato Akagi
Acoustical Science and Technology | VOL. 29
Xugang Lu, et. al.Xugang Lu ... Masato Akagi
01 Jan 2008
Acoustical Science and Technology | VOL. 29

MTF-based method of blind estimation of reverberation time in room acoustics
...
-
, et. al. ...
25 Aug 2008
25 Aug 2008

Toward blind reverberation time estimation for non-speech signals
João F Santos ... Nils Peters
The Journal of the Acoustical Society of America | VOL. 133
João F Santos, et. al.João F Santos ... Nils Peters
01 May 2013
The Journal of the Acoustical Society of America | VOL. 133

Towards blind reverberation time estimation for non-speech signals
Joao F Santos ... Tiago H Falk
-
Joao F Santos, et. al.Joao F Santos ... Tiago H Falk
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An MTF-based blind restoration of temporal power envelopes as a front-end processor for automatic speech recognition systems in reverberant environments

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America