Non-native pronunciation variation modeling using an indirect data driven method

Mina Kim Mina Kim,Hong Kook Kim Hong Kook Kim,Yoo Rhee Oh Yoo Rhee Oh

doi:10.1109/asru.2007.4430114

Abstract

In this paper, we propose a pronunciation variation modeling method for improving the performance of a non-native automatic speech recognition (ASR) system that does not degrade the performance of a native ASR system. The proposed method is based on an indirect data-driven approach, where pronunciation variability is investigated from the training speech data, and variant rules are subsequently derived and applied to compensate for variability in the ASR pronunciation dictionary. To this end, native utterances are first recognized by using a phoneme recognizer, and then the variant phoneme patterns of native speech are obtained by aligning the recognized and reference phonetic sequences. The reference sequences are transcribed by using each of canonical, knowledge-based, and hand-labeled methods. Similar to non-native speech, the variant phoneme patterns of non-native speech can also be obtained by recognizing non-native utterances and comparing the recognized phoneme sequences and reference phonetic transcriptions. Finally, variant rules are derived from native and non-native variant phoneme patterns using decision trees and applied to the adaptation of a dictionary for non-native and native ASR systems. In this paper, Korean spoken by Chinese native speakers is considered as the non-native speech. It is shown from non-native ASR experiments that an ASR system using the dictionary constructed by the proposed pronunciation variation modeling method can relatively reduce the average word error rate (WER) by 18.5% when compared to the baseline ASR system using a canonical transcribed dictionary. In addition, the WER of a native ASR system using the proposed dictionary is also relatively reduced by 1.1%, as compared to the baseline native ASR system with a canonical constructed dictionary.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Non-native pronunciation variation modeling using an indirect data driven method

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Acoustic and pronunciation model adaptation for context-independent and context-dependent pronunciation variability of non-native speech
Yoo Rhee Oh ... Mina Kim
-
Yoo Rhee Oh, et. al. Yoo Rhee Oh ... Mina Kim
01 Mar 2008
01 Mar 2008

Non-Native Pronunciation Variation Modeling for Automatic Speech Recognition
Hong Kook ... Mina Kim
-
Hong Kook, et. al.Hong Kook ... Mina Kim
16 Aug 2010
16 Aug 2010

A Hybrid Acoustic and Pronunciation Model Adaptation Approach for Non-native Speech Recognition
Yoo Rhee Oh ... Hong Kook Kim
IEICE Transactions on Information and Systems | VOL. E93-D
Yoo Rhee Oh, et. al.Yoo Rhee Oh ... Hong Kook Kim
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Non-native pronunciation variation modeling using an indirect data driven method

Abstract

Talk to us

Similar Papers