A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition

Yoo Rhee Oh,Hong Kook Kim

doi:10.1587/transinf.e93.d.2379

Abstract

In this paper, we propose a hybrid model adaptation approach in which pronunciation and acoustic models are adapted by incorporating the pronunciation and acoustic variabilities of non-native speech in order to improve the performance of non-native automatic speech recognition (ASR). Specifically, the proposed hybrid model adaptation can be performed at either the state-tying or triphone-modeling level, depending at which acoustic model adaptation is performed. In both methods, we first analyze the pronunciation variant rules of non-native speakers and then classify each rule as either a pronunciation variant or an acoustic variant. The state-tying level hybrid method then adapts pronunciation models and acoustic models by accommodating the pronunciation variants in the pronunciation dictionary and by clustering the states of triphone acoustic models using the acoustic variants, respectively. On the other hand, the triphone-modeling level hybrid method initially adapts pronunciation models in the same way as in the state-tying level hybrid method; however, for the acoustic model adaptation, the triphone acoustic models are then re-estimated based on the adapted pronunciation models and the states of the re-estimated triphone acoustic models are clustered using the acoustic variants. From the Korean-spoken English speech recognition experiments, it is shown that ASR systems employing the state-tying and triphone-modeling level adaptation methods can relatively reduce the average word error rates (WERs) by 17.1% and 22.1% for non-native speech, respectively, when compared to a baseline ASR system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEICE Transactions on Information and Systems	Publication Date: Jan 1, 2010
Citations: 3	License type: free

R Discovery Prime

R Discovery Prime

A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems

Lead the way for us

Similar Papers

A hybrid approach to adapting acoustic and pronunciation models for non-native speech recognition
Yoo Rhee Oh ... Hong Kook Kim
-
Yoo Rhee Oh, et. al.Yoo Rhee Oh ... Hong Kook Kim
01 Jan 2009
01 Jan 2009

Acoustic and pronunciation model adaptation for context-independent and context-dependent pronunciation variability of non-native speech
Yoo Rhee Oh ... Mina Kim
-
Yoo Rhee Oh, et. al. Yoo Rhee Oh ... Mina Kim
01 Mar 2008
01 Mar 2008

Non-native pronunciation variation modeling using an indirect data driven method
Mina Kim ... Yoo Rhee Oh
-
Mina Kim, et. al. Mina Kim ... Yoo Rhee Oh
01 Jan 2007
01 Jan 2007

Non-Native Pronunciation Variation Modeling for Automatic Speech Recognition
Hong Kook ... Mina Kim
-
Hong Kook, et. al.Hong Kook ... Mina Kim
16 Aug 2010
16 Aug 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems