Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework

Haitian Xu Haitian Xu,B Lindberg,P Dalsgaard,Zheng-Hua Tan Zheng-Hua Tan

doi:10.1109/icassp.2006.1660227

Abstract

Compared to multi-condition training (MTR), condition-dependent training generates multiple acoustic hidden Markov model sets each identified by a noisy environment and is known to perform substantially better for known noise types (included in training) while worse for unknown (untrained) noise types. This paper attempts to bridge the performance gap between known and unknown noise types by introducing a Minimum Mean-Square Error (MMSE) noise-type based compensation algorithm. On the basis of a modified Vector Taylor Series and the measurement of feature reliability as well as noise similarity, the MMSE estimation adapts the test features corrupted by the unknown noise type to the corresponding features corrupted by the known noise type. This method significantly improves the recognition performance for unknown noise types while maintaining the good performance for known noise types. Furthermore, in order to benefit directly from MTR, a model interpolation strategy is investigated which combines the MTR and the condition-dependent model sets. Both good performance and low computational cost are achieved by only interpolating the mixtures of each condition-dependent model state with the least weighted mixture in the corresponding MTR model state. The overall system gives promising results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Robust Audio-visual Speech Recognition Using Bimodal Dfsmn with Multi-condition Training and Dropout Regularization
Shiliang Zhang ... Ming Lei
-
Shiliang Zhang, et. al.Shiliang Zhang ... Ming Lei
01 May 2019
01 May 2019

Effect of multi-condition training and speech enhancement methods on spoofing detection
Hong Yu ... Zhanyu Ma
-
Hong Yu, et. al.Hong Yu ... Zhanyu Ma
01 Jul 2016
01 Jul 2016

Robust speech recognition using MLP neural network in log-spectral domain
Masoumeh P Ghaemmaghami ... Saeed Dabbaghchian
-
Masoumeh P Ghaemmaghami, et. al.Masoumeh P Ghaemmaghami ... Saeed Dabbaghchian
01 Dec 2009
01 Dec 2009

Noise Condition-Dependent Training Based on Noise Classification and SNR Estimation
Haitian Xu ... Zheng-Hua Tan
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15
Haitian Xu, et. al.Haitian Xu ... Zheng-Hua Tan
01 Nov 2007
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Speech Recognition From Noise-Type Based Feature Compensation and Model Interpolation in a Multiple Model Framework

Abstract

Talk to us

Similar Papers