A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction

Chu-Cheng Lin,Richard Tzong-Han Tsai

doi:10.1109/tasl.2011.2172424

Abstract

Most spoken Chinese dialects lack comprehensive digital pronunciation databases, which are crucial for speech processing tasks. Given complete pronunciation databases for related dialects, one can use supervised learning techniques to predict a Chinese character's pronunciation in a target dialect based on the character's features and its pronunciation in other related dialects. Unfortunately, Chinese dialect pronunciation databases are far from complete. We propose a novel generative model that makes use of both existing dialect pronunciation data plus medieval rime books to discover patterns that exist in multiple dialects. The proposed model can augment missing dialectal pronunciations based on existing dialect pronunciation tables (even if incomplete) and the pronunciation data in rime books. The augmented pronunciation database can then be used in supervised learning settings. We evaluate the prediction accuracy in terms of phonological features, such as tone, initial phoneme, final phoneme, etc. For each character, features are evaluated on the whole, overall pronunciation feature accuracy (OPFA). Our first experimental results show that adding features from dialectal pronunciation data to our baseline rime-book model dramatically improves OPFA using the support vector machine (SVM) model. In the second experiment, we compare the performance of the SVM model using phonological features from closely related dialects with that of the model using phonological features from non-closely related dialects. The experimental results show that using features from closely related dialects results in higher accuracy. In the third experiment, we show that using our proposed data augmentation model to fill in missing data can increase the SVM model's OPFA by up to 7.6%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: May 1, 2012
Citations: 3

Similar Papers

Traffic Volume Forecasting Model of Freeway Toll Stations During Holidays – An SVM Model
Xiaowei Hu ... Tianlin Wang
Promet - Traffic&Transportation | VOL. 34
Xiaowei Hu, et. al.Xiaowei Hu ... Tianlin Wang
15 Jun 2022
Promet - Traffic&Transportation | VOL. 34

مدل سازی پایداری خاکدانهها با استفاده از ماشینهای بردار پشتیبان و رگرسیون خطی چند متغیره
...
-
, et. al. ...
25 Apr 2015
25 Apr 2015

Comparative Analysis of ANN and SVM Models Combined with Wavelet Preprocess for Groundwater Depth Prediction
Ting Zhou ... Zhi Yang
Water | VOL. 9
Ting Zhou, et. al.Ting Zhou ... Zhi Yang
12 Oct 2017
Water | VOL. 9

Application of the Support Vector Machine on precipitation-runoff modelling in Fenhe River
Cai-Hong Hu ... Ze-Ning Wu
-
Cai-Hong Hu, et. al. Cai-Hong Hu ... Ze-Ning Wu
01 May 2011
01 May 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Generative Data Augmentation Model for Enhancing Chinese Dialect Pronunciation Prediction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing