A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models

Si Wei,Guoping Hu,Yu Hu,Ren-Hua Wang

doi:10.1016/j.specom.2009.03.004

Abstract

This paper presents two new ideas for text dependent mispronunciation detection. Firstly, mispronunciation detection is formulated as a classification problem to integrate various predictive features. A Support Vector Machine (SVM) is used as the classifier and the log-likelihood ratios between all the acoustic models and the model corresponding to the given text are employed as features for the classifier. Secondly, Pronunciation Space Models (PSMs) are proposed to enhance the discriminative capability of the acoustic models for pronunciation variations. In PSMs, each phone is modeled with several parallel acoustic models to represent pronunciation variations of that phone at different proficiency levels, and an unsupervised method is proposed for the construction of the PSMs. Experiments on a database consisting of more than 500,000 Mandarin syllables collected from 1335 Chinese speakers show that the proposed methods can significantly outperform the traditional posterior probability based method. The overall recall rates for the 13 most frequently mispronounced phones increase from 17.2%, 7.6% and 0% to 58.3%, 44.3% and 29.5% at three precision levels of 60%, 70% and 80%, respectively. The improvement is also demonstrated by a subjective experiment with 30 subjects, in which 53.3% of the subjects think the proposed method is better than the traditional one and 23.3% of them think that the two methods are comparable.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Mar 28, 2009
Citations: 82

Similar Papers

Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers
Wenping Hu ... Yong Wang
Speech Communication | VOL. 67
Wenping Hu, et. al.Wenping Hu ... Yong Wang
08 Jan 2015
Speech Communication | VOL. 67

Improving Mispronunciation Detection of Arabic Words for Non-Native Learners Using Deep Convolutional Neural Network Features
Shamila Akhtar ... Naveed Khan Baloch
Electronics | VOL. 9
Shamila Akhtar, et. al.Shamila Akhtar ... Naveed Khan Baloch
09 Jun 2020
Electronics | VOL. 9

An Arabic Mispronunciation Detection System Based on the Frequency of Mistakes for Asian Speakers
Faria Nazir ... Muazzam Maqsood
Mehran University Research Journal of Engineering and Technology | VOL. 40
Faria Nazir, et. al.Faria Nazir ... Muazzam Maqsood
01 Apr 2021
Mehran University Research Journal of Engineering and Technology | VOL. 40

A transfer learning approach to goodness of pronunciation based automatic mispronunciation detection.
Hao Huang ... Ying Hu
The Journal of the Acoustical Society of America | VOL. 142
Hao Huang, et. al.Hao Huang ... Ying Hu
01 Nov 2017
The Journal of the Acoustical Society of America | VOL. 142

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new method for mispronunciation detection using Support Vector Machine based on Pronunciation Space Models

Abstract

Talk to us

Similar Papers

More From: Speech Communication