Generation of robust phonetic set and decision tree for Mandarin using chi-square testing

Yeou-Jiunn Chen,Chung-Hsien Wu,Yu-Hsien Chiu,Hsiang-Chuan Liao

doi:10.1016/s0167-6393(01)00076-0

Abstract

A phonetic representation of a language is used to describe the corresponding pronunciation and synthesize the acoustic model of any vocabulary. A phonetic representation with smaller phonetic units such as SAMPA-C for Mandarin Chinese and decision trees for parameter sharing are broadly applied to deal with the problem of large numbers of recognition units. However, the confusable phonetic representation in SAMPA-C generally degrades the recognition performance. In this paper, a statistical method based on chi-square testing is used to investigate the phonetic unit characteristics that are confusing and develop a more reliable phonetic set, named modified SAMPA-C. A corresponding question set for the modified SAMPA-C and a two-level splitting criterion are also proposed to effectively and efficiently construct the decision trees. Experiments using continuous Mandarin telephone speech recognition were conducted. Experimental results show that an encouraging improvement in recognition performance can be obtained. The proposed approaches represent a good compromise between the demands of accurate acoustic modeling and the limitations imposed by insufficient training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generation of robust phonetic set and decision tree for Mandarin using chi-square testing

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Nov 5, 2001
Citations: 28

Similar Papers

Robust decision tree state tying for continuous speech recognition
W Reichl ... Wu Chou
IEEE Transactions on Speech and Audio Processing | VOL. 8
W Reichl, et. al.W Reichl ... Wu Chou
01 Jan 1999
IEEE Transactions on Speech and Audio Processing | VOL. 8

Learning from examples: generation and evaluation of decision trees for software resource analysis
R.W Selby ... A.A Porter
IEEE Transactions on Software Engineering | VOL. 14
R.W Selby, et. al.R.W Selby ... A.A Porter
01 Jan 1987
IEEE Transactions on Software Engineering | VOL. 14

Integrating different acoustic and syntactic language models in a continuous speech recognition system
Amparo Varona ... In Torres
-
Amparo Varona, et. al.Amparo Varona ... In Torres
16 Oct 2000
16 Oct 2000

Integrate template matching and statistical modeling for continuous speech recognition
Xie Sun
-
Xie SunXie Sun
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generation of robust phonetic set and decision tree for Mandarin using chi-square testing

Abstract

Talk to us

Similar Papers

More From: Speech Communication