Factorial Models for Noise Robust Speech Recognition

John R. Hershey,Jonathan Le Roux,Steven J. Rennie

doi:10.1002/9781118392683.ch12

Abstract

Noise compensation techniques for robust automatic speech recognition (ASR) attempt to improve system performance in the presence of acoustic interference. In feature-based noise compensation, which includes speech enhancement approaches, the acoustic features that are sent to the recognizer are first processed to remove the effects of noise (see Chapter 9). Model compensation approaches, in contrast, are concerned with modifying and even extending the acoustic model of speech to account for the effects of noise. A taxonomy of the different approaches to noise compensation is depicted in Figure 12.1, which serves as a road map for the present discussion. The two main strategies used for model compensation approaches are model adaptation and model-based noise compensation. Model adaptation approaches implicitly account for noise by adjusting the parameters of the acoustic model of speech, whereas model-based noise compensation approaches explicitly model the noise and its effect on the noisy speech features. Common adaptation approaches include maximum likelihood linear regression (MLLR) [56], maximum a posteriori (MAP) adaptation [32], and their generalizations [17, 29, 47]. These approaches, which are discussed in Chapter 11, alter the speech acoustic model in a completely data-driven way given additional training data or test data. Adaptation methods are somewhat more general than model-based approaches in that they may handle effects on the signal that are difficult to explicitly model, such as nonlinear distortion and changes in the voice in reaction to noise (the Lombard effect [53]). However, in the presence of additive noise, failing to take take into account the known interactions between speech and noise can be detrimental to performance. Model-based noise compensation approaches, in contrast to adaptation approaches, explicitly model the different factors present in the acoustic environment: the speech, the various sources of acoustic interference, and how they interact to form the noisy speech

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Factorial Models for Noise Robust Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique
Haipeng Wang ... Yonghong Yan
-
Haipeng Wang, et. al.Haipeng Wang ... Yonghong Yan
01 Dec 2009
01 Dec 2009

Analysis on MAP and MLLR based speaker adaptation techniques in speech recognition
T Ramya ... P Vijayalakshmi
-
T Ramya, et. al.T Ramya ... P Vijayalakshmi
01 Mar 2014
01 Mar 2014

Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
Xiaodong Cui ... A Alwan
IEEE Transactions on Speech and Audio Processing | VOL. 13
Xiaodong Cui, et. al. Xiaodong Cui ... A Alwan
01 Nov 2005
IEEE Transactions on Speech and Audio Processing | VOL. 13

Investigating the adaptation of Arabic speech recognition systems to foreign accented speakers
Yousef Ajami Alotaibi ... Sid-Ahmed Selouani
-
Yousef Ajami Alotaibi, et. al.Yousef Ajami Alotaibi ... Sid-Ahmed Selouani
01 May 2010
01 May 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Factorial Models for Noise Robust Speech Recognition

Abstract

Talk to us

Similar Papers