Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients

Ben Milner,Jonathan Darch

doi:10.1109/tasl.2010.2047811

Abstract

This paper examines the effect of applying noise compensation to acoustic speech feature prediction from noisy mel-frequency cepstral coefficient (MFCC) vectors within a distributed speech recognition architecture. An acoustic speech feature (comprising fundamental frequency, formant frequencies, speech/nonspeech classification, and voicing classification) is predicted from an MFCC vector in a maximum a posteriori (MAP) framework using phoneme-specific or global models of speech. The effect of noise is considered and three different noise compensation methods, that have been successful in robust speech recognition, are integrated within the MAP framework. Experiments show that noise compensation can be applied successfully to prediction with best performance given by a model adaptation method that performs only slightly worse than matched training and testing. Further experiments consider application of the predicted acoustic features to speech reconstruction. A series of human listening tests show that the predicted features are sufficient for speech reconstruction and that noise compensation improves speech quality in noisy conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Feb 1, 2011
Citations: 19

Similar Papers

Analysis and prediction of acoustic speech features from mel-frequency cepstral coefficients in distributed speech recognition architectures
Jonathan Darch ... Saeed Vaseghi
The Journal of the Acoustical Society of America | VOL. 124
Jonathan Darch, et. al.Jonathan Darch ... Saeed Vaseghi
01 Dec 2008
The Journal of the Acoustical Society of America | VOL. 124

Applying noise compensation methods to robustly predict acoustic speech features from MFCC vectors in noise
Ben Milner ... Jonathan Darch
-
Ben Milner, et. al.Ben Milner ... Jonathan Darch
01 Mar 2008
01 Mar 2008

HMM-based MAP prediction of voiced and unvoiced formant frequencies from noisy MFCC vectors
Jonathan Darch ... Ben Milner
-
Jonathan Darch, et. al.Jonathan Darch ... Ben Milner
17 Sep 2006
17 Sep 2006

MAP prediction of formant frequencies and voicing class from MFCC vectors in noise
Jonathan Darch ... Saeed Vaseghi
Speech Communication | VOL. 48
Jonathan Darch, et. al.Jonathan Darch ... Saeed Vaseghi
07 Jul 2006
Speech Communication | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Acoustic Speech Feature Prediction From Noisy Mel-Frequency Cepstral Coefficients

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing