Mandarin pronunciation modeling based on CASS corpus

Fang Zheng,Byrne William,Zhanjiang Song,Pascale Fung

doi:10.1007/bf02947304

Abstract

The pronunciation variability is an important issue that must be faced with when developing practical automatic spontaneous speech recognition systems. In this paper, the factors that may affect the recognition performance are analyzed, including those specific to the Chinese language. By studying the INITIAL/FINAL (IF) characteristics of Chinese language and developing the Bayesian equation, the concepts of generalized INITIAL/FINAL (GIF) and generalized syllable (GS), the GIF modeling and the IF-GIF modeling, as well as the context-dependent pronunciation weighting, are proposed based on a well phonetically transcribed seed database. By using these methods, the Chinese syllable error rate (SER) is reduced by 6.3% and 4.2% compared with the GIF modeling and IF modeling respectively when the language model, such as syllable or word N-gram, is not used. The effectiveness of these methods is also proved when more data without the phonetic transcription are used to refine the acoustic model using the proposed iterative forced-alignment based transcribing (IFABT) method, achieving a 5.7% SER reduction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mandarin pronunciation modeling based on CASS corpus

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology

Lead the way for us

Journal: Journal of Computer Science and Technology	Publication Date: May 1, 2002
Citations: 32

Similar Papers

Modeling pronunciation variation using context-dependent weighting and b/s refined acoustic modeling
Fang Zheng ... Zhanjiang Song
-
Fang Zheng, et. al.Fang Zheng ... Zhanjiang Song
03 Sep 2001
03 Sep 2001

A Myanmar large vocabulary continuous speech recognition system
Hay Mar Soe Naing ... Xinhui Hu
-
Hay Mar Soe Naing, et. al.Hay Mar Soe Naing ... Xinhui Hu
01 Dec 2015
01 Dec 2015

State-dependent phoneme-based model merging for dialectal Chinese speech recognition
Linquan Liu ... Wenhu Wu
Speech Communication | VOL. 50
Linquan Liu, et. al.Linquan Liu ... Wenhu Wu
07 May 2008
Speech Communication | VOL. 50

Automatic initial/final generation for dialectal Chinese speech recognition
Linquan Liu ... Wenhu Wu
-
Linquan Liu, et. al.Linquan Liu ... Wenhu Wu
17 Sep 2006
17 Sep 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mandarin pronunciation modeling based on CASS corpus

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology