A wavelet-based thresholding approach to reconstructing unreliable spectrogram components

Shirin Badiezadegan,Richard C Rose

doi:10.1016/j.specom.2014.11.005

Abstract

Data imputation approaches for robust automatic speech recognition reconstruct noise corrupted spectral information by exploiting prior knowledge of the relationship between target speech and background through the use of spectrographic masks. Most of these approaches are model-based techniques that can only provide accurate estimates of the underlying clean speech when the characteristics of the noise corrupted features do not deviate from those of the model. Discrete wavelet transform (DWT) based de-noising methods can also be used for re-estimating the underlying clean speech from a noise corrupted signal, but often require that the background noise is stationary and modeled by a Gaussian distribution. A novel approach is presented here for incorporating the information derived from spectrographic masks in a DWT-based de-noising method. The spectrographic masks are used for deriving thresholds for de-noising wavelet domain coefficients making DWT based de-noising more suitable for non-stationary noise conditions. The results of an experimental study are presented to demonstrate the performance of DWT based data imputation relative to other established techniques on the Aurora 2 noisy speech recognition task. It will be shown that the proposed approach reduces the impact of model mismatch associated with parametric approaches and exploits the robustness of non-parametric wavelet de-noising approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A wavelet-based thresholding approach to reconstructing unreliable spectrogram components

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Dec 8, 2014
Citations: 10

Similar Papers

A wavelet-based data imputation approach to spectrogram reconstruction for robust speech recognition
Shirin Badiezadegan ... Richard C Rose
-
Shirin Badiezadegan, et. al.Shirin Badiezadegan ... Richard C Rose
01 May 2011
01 May 2011

Deep Learning for Minimum Mean-Square Error and Missing Data Approaches to Robust Speech Processing

-

04 Dec 2020
04 Dec 2020

Perceptual consequences of different signal changes due to binaural noise reduction: do hearing loss and working memory capacity play a role?
Tobias Neher ... Giso Grimm
Ear & Hearing | VOL. 35
Tobias Neher, et. al.Tobias Neher ... Giso Grimm
01 Sep 2014
Ear & Hearing | VOL. 35

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A wavelet-based thresholding approach to reconstructing unreliable spectrogram components

Abstract

Talk to us

Similar Papers

More From: Speech Communication