Maximum entropy modeling for speech recognition

H.-K.J Kuo

doi:10.1109/chinsl.2004.1409569

Abstract

Summary form only given. Maximum entropy (maxent) models have become very popular in natural language processing. We begin with a basic introduction of the maximum entropy principle, cover the popular algorithms for training maxent models, and describe how maxent models have been used in language modeling and (more recently) acoustic modeling for speech recognition. Some comparisons with other discriminative modeling methods is made. A substantial amount of time is devoted to the details of a new framework for acoustic modeling using maximum entropy direct models, including practical issues of implementation and usage. Traditional statistical models for speech recognition have all been based on a Bayesian framework using generative models such as hidden Markov models (HMM). The new framework is based on maximum entropy direct modeling, where the probability of a state or word sequence given an observation sequence is computed directly from the model. In contrast to HMM, features can be asynchronous and overlapping, and need not be statistically independent. This model therefore allows for the potential combination of many different types of features. Results from a specific kind of direct model, the maximum entropy Markov model (MEMM) are presented. Even with conventional acoustic features, the approach already shows promising results for phone level decoding. The MEMM significantly outperforms traditional HMM in word error rate when used as stand-alone acoustic models. Combining the MEMM scores with HMM and language model scores shows modest improvements over the best HMM speech recognizer. We give a sense of some exciting possibilities for future research in using maximum entropy models for acoustic modeling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Maximum entropy modeling for speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Maximum entropy direct models for speech recognition
H.-K.J Kuo ... Yuqing Gao
-
H.-K.J Kuo, et. al.H.-K.J Kuo ... Yuqing Gao
30 Nov 2003
30 Nov 2003

Maximum entropy direct models for speech recognition
Hong-Kwang Jeff ... Yuqing Gao
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
Hong-Kwang Jeff, et. al. Hong-Kwang Jeff ... Yuqing Gao
01 May 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

Robust i-vector based adaptation of DNN acoustic model for speech recognition
Sri Garimella ... Sree Hari Krishnan Parthasarathi
-
Sri Garimella, et. al.Sri Garimella ... Sree Hari Krishnan Parthasarathi
06 Sep 2015
06 Sep 2015

Fast and accurate recurrent neural network acoustic models for speech recognition
Haşim Sak ... Kanishka Rao
-
Haşim Sak, et. al.Haşim Sak ... Kanishka Rao
06 Sep 2015
06 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Maximum entropy modeling for speech recognition

Abstract

Talk to us

Similar Papers