Robust state clustering using phonetic decision trees

Chaojun Liu,Yonghong Yan

doi:10.1016/j.specom.2003.12.003

Abstract

The widely used acoustic modeling approach of phonetic decision-tree based context clustering does not take full advantage of limited training data, and therefore fails to produce robust acoustic models. Two problems are identified: (1) all states clustered in a leaf node must share the same set of Gaussian components and mixture weights; no distinction is provided among those states; (2) rarely seen triphones in the training data might be poorly estimated and cause an adverse effect on decision-tree clustering. We propose a number of approaches to address these problems by more efficient use of training data. Specifically, (1) a two-level decision-tree approach for the first problem that ties Gaussian components and mixture weights separately, as they require different amounts of data to obtain robust estimation of their parameters; and (2) a two-stage decision-tree based clustering approach and a MAP-based approach for the second problem. Each approach gives a statistical significant reduction of the word error rate (WER) over the traditional approach. The systems combining all new approaches achieve the best performance, which reduce the WERs of the baseline systems by 14–17% and reduce the model sizes by 8–11% on the WSJ tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust state clustering using phonetic decision trees

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Dec 29, 2003
Citations: 29

Similar Papers

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia
Andreas Widjaja ... Vincent Elbert Budiman
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6
Andreas Widjaja, et. al.Andreas Widjaja ... Vincent Elbert Budiman
10 Aug 2020
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6

Ensemble acoustic modeling in automatic speech recognition
Xin Chen
-
Xin ChenXin Chen
01 Jan 2010
01 Jan 2010

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition
Jian Xue ... Yunxin Zhao
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16
Jian Xue, et. al. Jian Xue ... Yunxin Zhao
01 Mar 2008
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust state clustering using phonetic decision trees

Abstract

Talk to us

Similar Papers

More From: Speech Communication