Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition

Jing Zheng,Nelson Morgan,Ozgur Cetin,Andreas Stolcke,Mei-Yuh Hwang,Xin Lei

doi:10.1109/icassp.2007.366992

Abstract

Recent developments in large vocabulary continuous speech recognition (LVCSR) have shown the effectiveness of discriminative training approaches, employing the following three representative techniques: discriminative Gaussian training using the minimum phone error (MPE) criterion, discriminately trained features estimated by multilayer perceptrons (MLPs); and discriminative feature transforms such as feature-level MPE (fMPE). Although MLP features, MPE models, and fMPE transforms have each been shown to improve recognition accuracy, no previous work has applied all three in a single LVCSR system. This paper uses a state-of-the-art Mandarin recognition system as a platform to study the interaction of all three techniques. Experiments in the broadcast news and broadcast conversation domains show that the contribution of each technique is nonredundant, and that the full combination yields the best performance and has good domain generalization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Towards more efficient and accurate methods for Mandarin LVCSR discriminative training
Haihua Xu ... Jie Zhu
-
Haihua Xu, et. al. Haihua Xu ... Jie Zhu
01 Jun 2008
01 Jun 2008

Mandarin-English bilingual phone modeling and combining MPE based Discriminative training for cross-language speech recognition
Yanmin Qian ... Jia Liu
-
Yanmin Qian, et. al.Yanmin Qian ... Jia Liu
01 Nov 2010
01 Nov 2010

Discriminative cluster adaptive training
Kai Yu ... M.J.F Gales
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
Kai Yu, et. al. Kai Yu ... M.J.F Gales
01 Sep 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

On using MLP features in LVCSR
Qifeng Zhu ... Barry Chen
-
Qifeng Zhu, et. al.Qifeng Zhu ... Barry Chen
04 Oct 2004
04 Oct 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition

Abstract

Talk to us

Similar Papers