Joint acoustic and language modeling for speech recognition

Jen-Tzung Chien,Chuang-Hua Chueh

doi:10.1016/j.specom.2009.10.003

Abstract

In a traditional model of speech recognition, acoustic and linguistic information sources are assumed independent of each other. Parameters of hidden Markov model (HMM) and n-gram are separately estimated for maximum a posteriori classification. However, the speech features and lexical words are inherently correlated in natural language. Lacking combination of these models leads to some inefficiencies. This paper reports on the joint acoustic and linguistic modeling for speech recognition by using the acoustic evidence in estimation of the linguistic model parameters, and vice versa, according to the maximum entropy (ME) principle. The discriminative ME (DME) models are exploited by using features from competing sentences. Moreover, a mutual ME (MME) model is built for sentence posterior probability, which is maximized to estimate the model parameters by characterizing the dependence between acoustic and linguistic features. The N-best Viterbi approximation is presented in implementing DME and MME models. Additionally, the new models are incorporated with the high-order feature statistics and word regularities. In the experiments, the proposed methods increase the sentence posterior probability or model separation. Recognition errors are significantly reduced in comparison with separate HMM and n-gram model estimations from 32.2% to 27.4% using the MATBN corpus and from 5.4% to 4.8% using the WSJ corpus (5K condition).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Joint acoustic and language modeling for speech recognition

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Oct 24, 2009
Citations: 35

Similar Papers

Maximum entropy modeling for speech recognition
H.-K.J Kuo
-
H.-K.J KuoH.-K.J Kuo
15 Dec 2004
15 Dec 2004

Maximum Entropy Modeling of Acoustic and Linguistic Features
C Chueh ... Jen-Tzung Chien
-
C Chueh, et. al.C Chueh ... Jen-Tzung Chien
14 May 2006
14 May 2006

Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition
Qiang Huo ... Chorkin Chan
IEEE Transactions on Speech and Audio Processing | VOL. 3
Qiang Huo, et. al. Qiang Huo ... Chorkin Chan
01 Jan 1995
IEEE Transactions on Speech and Audio Processing | VOL. 3

Maximum entropy direct models for speech recognition
H.-K.J Kuo ... Yuqing Gao
-
H.-K.J Kuo, et. al.H.-K.J Kuo ... Yuqing Gao
30 Nov 2003
30 Nov 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Joint acoustic and language modeling for speech recognition

Abstract

Talk to us

Similar Papers

More From: Speech Communication