Maximum-likelihood stochastic-transformation adaptation of hidden Markov models

V.D Diakoloukas,V.V Digalakis

doi:10.1109/89.748122

Abstract

The recognition accuracy in previous large vocabulary automatic speech recognition (ASR) systems is highly related to the existing mismatch between the training and testing sets. For example, dialect differences across the training and testing speakers result in a significant degradation in recognition performance. Some popular adaptation approaches improve the recognition performance of speech recognizers based on hidden Markov models with continuous mixture densities by using linear transformations to adapt the means, and possibly the covariances of the mixture Gaussians. The linear assumption, however, is too restrictive, and in this paper we propose a novel adaptation technique that adapts the means and, optionally, the covariances of the mixture Gaussians by using multiple stochastic transformations. We perform both speaker and dialect adaptation experiments, and we show that our method significantly improves the recognition accuracy and the robustness of our system. The experiments are carried out with SRI's DECIPHER speech recognition system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Maximum-likelihood stochastic-transformation adaptation of hidden Markov models

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing

Lead the way for us

Journal: IEEE Transactions on Speech and Audio Processing	Publication Date: Mar 1, 1999
Citations: 59

Similar Papers

Development of HMM Based Automatic Speech Recognition System for Indian English
Anushri Garud ... Arti Bang
-
Anushri Garud, et. al.Anushri Garud ... Arti Bang
01 Aug 2018
01 Aug 2018

HMM Adaptation Using Statistical Linear Approximation for Robust Speech Recognition
Berkovitch Michael ... Shallom D.Il
-
Berkovitch Michael, et. al.Berkovitch Michael ... Shallom D.Il
23 Jun 2011
23 Jun 2011

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Multifactor adaptation for Mandarin broadcast news and conversation speech recognition
Wen Wang ... Jing Zheng
-
Wen Wang, et. al.Wen Wang ... Jing Zheng
06 Sep 2009
06 Sep 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Maximum-likelihood stochastic-transformation adaptation of hidden Markov models

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing