A Dialectal Chinese Speech Recognition Framework

Jing Li,Thomas Fang Zheng,William Byrne,Dan Jurafsky

doi:10.1007/s11390-006-0106-9

Abstract

A framework for dialectal Chinese speech recognition is proposed and studied, in which a relatively small dialectal Chinese (or in other words Chinese influenced by the native dialect) speech corpus and dialect-related knowledge are adopted to transform a standard Chinese (or Putonghua, abbreviated as PTH) speech recognizer into a dialectal Chinese speech recognizer. Two kinds of knowledge sources are explored: one is expert knowledge and the other is a small dialectal Chinese corpus. These knowledge sources provide information at four levels: phonetic level, lexicon level, language level, and acoustic decoder level. This paper takes Wu dialectal Chinese (WDC) as an example target language. The goal is to establish a WDC speech recognizer from an existing PTH speech recognizer based on the Initial-Final structure of the Chinese language and a study of how dialectal Chinese speakers speak Putonghua. The authors propose to use context-independent PTH-IF mappings (where IF means either a Chinese Initial or a Chinese Final), context-independent WDC-IF mappings, and syllable-dependent WDC-IF mappings (obtained from either experts or data), and combine them with the supervised maximum likelihood linear regression (MLLR) acoustic model adaptation method. To reduce the size of the multi-pronunciation lexicon introduced by the IF mappings, which might also enlarge the lexicon confusion and hence lead to the performance degradation, a Multi-Pronunciation Expansion (MPE) method based on the accumulated uni-gram probability (AUP) is proposed. In addition, some commonly used WDC words are selected and added to the lexicon. Compared with the original PTH speech recognizer, the resulting WDC speech recognizer achieves 10–18% absolute Character Error Rate (CER) reduction when recognizing WDC, with only a 0.62% CER increase when recognizing PTH. The proposed framework and methods are expected to work not only for Wu dialectal Chinese but also for other dialectal Chinese languages and even other languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Dialectal Chinese Speech Recognition Framework

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology

Lead the way for us

Journal: Journal of Computer Science and Technology	Publication Date: Jan 1, 2006
Citations: 25

Similar Papers

State-dependent phoneme-based model merging for dialectal Chinese speech recognition
Linquan Liu ... Wenhu Wu
Speech Communication | VOL. 50
Linquan Liu, et. al.Linquan Liu ... Wenhu Wu
07 May 2008
Speech Communication | VOL. 50

A new speech corpus of super-elderly Japanese for acoustic modeling
Meiko Fukuda ... Norihide Kitaoka
Computer Speech & Language | VOL. 77
Meiko Fukuda, et. al.Meiko Fukuda ... Norihide Kitaoka
24 Jun 2022
Computer Speech & Language | VOL. 77

A hybrid CTC+Attention model based on end-to-end framework for multilingual speech recognition
Sendong Liang ... Wei Qi Yan
Multimedia Tools and Applications | VOL. 81
Sendong Liang, et. al.Sendong Liang ... Wei Qi Yan
20 May 2022
Multimedia Tools and Applications | VOL. 81

Large vocabulary taiwanese (min-nan) speech recognition using tone features and statistical pronunciation modeling
Dau-Cheng Lyu ... Yuang-Chin Chiang
-
Dau-Cheng Lyu, et. al.Dau-Cheng Lyu ... Yuang-Chin Chiang
01 Sep 2003
01 Sep 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Dialectal Chinese Speech Recognition Framework

Abstract

Talk to us

Similar Papers

More From: Journal of Computer Science and Technology