Model Generation of Accented Speech using Model Transformation and Verification for Bilingual Speech Recognition

Han-Ping Shen,Chung-Hsien Wu,Pei-Shan Tsai

doi:10.1145/2661637

Abstract

Nowadays, bilingual or multilingual speech recognition is confronted with the accent-related problem caused by non-native speech in a variety of real-world applications. Accent modeling of non-native speech is definitely challenging, because the acoustic properties in highly-accented speech pronounced by non-native speakers are quite divergent. The aim of this study is to generate highly Mandarin-accented English models for speakers whose mother tongue is Mandarin. First, a two-stage, state-based verification method is proposed to extract the state-level, highly-accented speech segments automatically. Acoustic features and articulatory features are successively used for robust verification of the extracted speech segments. Second, Gaussian components of the highly-accented speech models are generated from the corresponding Gaussian components of the native speech models using a linear transformation function. A decision tree is constructed to categorize the transformation functions and used for transformation function retrieval to deal with the data sparseness problem. Third, a discrimination function is further applied to verify the generated accented acoustic models. Finally, the successfully verified accented English models are integrated into the native bilingual phone model set for Mandarin-English bilingual speech recognition. Experimental results show that the proposed approach can effectively alleviate recognition performance degradation due to accents and can obtain absolute improvements of 4.1%, 1.8%, and 2.7% in word accuracy for bilingual speech recognition compared to that using traditional ASR approaches, MAP-adapted, and MLLR-adapted ASR methods, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Model Generation of Accented Speech using Model Transformation and Verification for Bilingual Speech Recognition

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Apr 20, 2015
Citations: 1

Similar Papers

Bilingual Speech Recognition System for Isolated Words Using Deep Neural Network
B Bharathi ... S Sugapriya
-
B Bharathi, et. al.B Bharathi ... S Sugapriya
01 Feb 2018
01 Feb 2018

Phone modeling and combining discriminative training for mandarinenglish bilingual speech recognition
Yanmin Qian ... Jia Liu
-
Yanmin Qian, et. al.Yanmin Qian ... Jia Liu
01 Jan 2009
01 Jan 2009

Multi-pronounciation dictionary construction for Mandarin-English bilingual phrase speech recognition system
C Wang ... W Shi
-
C Wang, et. al.C Wang ... W Shi
01 Jul 2015
01 Jul 2015

Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization
Brian Yan ... Meng Yu
-
Brian Yan, et. al.Brian Yan ... Meng Yu
23 May 2022
23 May 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Model Generation of Accented Speech using Model Transformation and Verification for Bilingual Speech Recognition

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing