Comparative Evaluation of Single‐Channel MMSE‐Based Noise Reduction Schemes for Speech Recognition

Emanuele Principi,Simone Cifani,Stefano Squartini,Rudy Rotili,Francesco Piazza

doi:10.1155/2010/962103

Emanuele Principi, Simone Cifani + Show 3 more

Open Access

PDF Available

https://doi.org/10.1155/2010/962103

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

One of the big challenges in the field of Automatic Speech Recognition (ASR) consists in developing suitable solutions able to work properly also in adverse acoustic conditions, like in presence of additive noise and/or in reverberant rooms. Recently a certain attention has been paid to deeply integrate the noise suppressor in the feature extraction pipeline. In this paper, different single‐channel MMSE‐based noise reduction schemes have been implemented both in the frequency and cepstral domains and the related recognition performances evaluated on the AURORA2 and AURORA4 databases, therefore providing a useful reference for the scientific community.

Highlights

Automatic Speech Recognition (ASR) is a challenging task largely addressed by the scientific community in the last two decades
The followed approach is similar to Ephraim and Malah algorithm (E&M) [4] but differs because the algorithm is applied to the power spectral magnitude of the filter bank’s output instead of the DFT spectral amplitude and because the noise variance takes into account the phase difference between the noise and the clean speech
Frequency domain results on AURORA2 show that LSA algorithm produces a remarkable improvement of recognition accuracy, and that the Global SNR (gSNR) modification gives a further increase of about 2% on average

Summary

Introduction

Automatic Speech Recognition (ASR) is a challenging task largely addressed by the scientific community in the last two decades. A notable interest raised during last years in the study and development of robust solutions in presence of acoustic nonidealities [1], for example, background noise, simultaneous speakers, and reverberation As result of these efforts, a profuse literature of environment-robust ASR techniques has been registered. The following classification can be proposed therein, as highlighted in [2]: featuredomain (FD) and model-based (MB) algorithms The latter class encompasses all methodologies aimed to adapt the acoustic model (HMM) parameters in order to maximize the system matching to the distorted environment. We can cite the log-spectral amplitude MMSE suppression rules due to their efficacy to reduce noise at a cost of low distortion level [4, 5] These rules have been implemented in the cepstral domain, so working closely to the backend [6, 7].

Background on Frequenc-Domain MMSE Algorithms

Gain Modification Based on Soft Decision

Background on Cepstral Domain MMSE Algorithms

Cepstral Domain Gain Modification Based on Soft Decision

Computer Simulations

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Electrical and Computer Engineering	Publication Date: Jan 1, 2010
Citations: 17	License type: CC BY 3.0

R Discovery Prime

Comparative Evaluation of Single‐Channel MMSE‐Based Noise Reduction Schemes for Speech Recognition

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Journal of Electrical and Computer Engineering

Lead the way for us

Similar Papers

A comparative study of mel cepstra and EIH for phone classification under adverse conditions
S Sandhu ... O Ghitza
-
S Sandhu, et. al.S Sandhu ... O Ghitza
09 May 1995
09 May 1995

Deep Variational Filter Learning Models for Speech Recognition
Purvi Agrawal ... Sriram Ganapathy
-
Purvi Agrawal, et. al.Purvi Agrawal ... Sriram Ganapathy
01 May 2019
01 May 2019

Auditory processing-based features for improving speech recognition in adverse acoustic conditions
Hari Krishna Maganti ... Marco Matassoni
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2014
Hari Krishna Maganti, et. al.Hari Krishna Maganti ... Marco Matassoni
06 May 2014
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2014

Comparison of visual features for audio‐visual speech recognition using the AURORA‐2J‐AV database
Takahiro Togo ... Takayuki Kitasaka
The Journal of the Acoustical Society of America | VOL. 120
Takahiro Togo, et. al.Takahiro Togo ... Takayuki Kitasaka
01 Nov 2006
The Journal of the Acoustical Society of America | VOL. 120

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Comparative Evaluation of Single‐Channel MMSE‐Based Noise Reduction Schemes for Speech Recognition

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Journal of Electrical and Computer Engineering