Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments

Fanuel Melak Asmare,Stefan Goetze,Mathias Bode,Bernard Mayer,Feifei Xiong

doi:10.1121/1.4950678

Abstract

This work evaluates multi-microphone beamforming techniques and single-microphone spectral enhancement strategies to alleviate the reverberation effect for robust automatic speech recognition (ASR) systems in different reverberant environments characterized by different reverberation times T60 and direct-to- reverberation ratios (DRRs). The systems under test consist of minimum variance distortionless response (MVDR) beamformers in combination with minimum mean square error (MMSE) estimators. For the later, reliable late reverberation spectral variance (LRSV) estimation employing a generalized model of the room impulse response (RIR) is crucial. Based on the generalized RIR model which separates the direct path from the remaining RIR, two different frequency resolutions in the short time Fourier transform (STFT) domain are evaluated, referred to as short- and long-term, to effectively estimate the direct signal. Regarding to the fusion between the MVDR beamformer and the MMSE estimator, the LRSV estimator can operate either on the multi-channel observed speech signals or on the single-channel beamformer output. By this, in this contribution, four different combination system architectures are evaluated and analyzed with a focus on optimal ASR performance w.r.t. word error rate (WER).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

A study on joint beamforming and spectral enhancement for robust speech recognition in reverberant environments
Feifei Xiong ... Bernd T Meyer
-
Feifei Xiong, et. al.Feifei Xiong ... Bernd T Meyer
01 Apr 2015
01 Apr 2015

Performance Comparison of DS/SS Code Acquisition using MMSE and MVDR Beamforming in Jamming
Henri Puska ... Jari Iinatti
-
Henri Puska, et. al.Henri Puska ... Jari Iinatti
01 Oct 2007
01 Oct 2007

On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition
Xiong Xiao ... Eng Siong Chng
-
Xiong Xiao, et. al.Xiong Xiao ... Eng Siong Chng
01 Mar 2017
01 Mar 2017

Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction
Shengkui Zhao ... Thi Ngoc Tho Nguyen
-
Shengkui Zhao, et. al.Shengkui Zhao ... Thi Ngoc Tho Nguyen
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Joint beamforming and spectral enhancement for robust automatic speech recognition in reverberant environments

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America