Speech Synthesis for Error Training Models in CALL

Xin Zhang,Guangguang Ma,Wenli Zhou,Weiping Ye,Jiping Wan,Tin Shing Chiu,Qin Lu,Qiao Li

doi:10.1007/978-3-642-00831-3_24

Abstract

A computer assisted pronunciation teaching system (CAPT) is a fundamental component in a computer assisted language learning system (CALL). A speech recognition based CAPT system often requires a large amount of speech data to train the incorrect phone models in its speech recognizer. But collecting incorrectly pronounced speech data is a labor intensive and costly work. This paper reports an effort on training the incorrect phone models by making use of synthesized speech data. A special formant speech synthesizer is designed to filter the correctly pronounced phones into incorrect phones by modifying the formant frequencies. In a Chinese Putonghua CALL system for native Cantonese speakers to learn Mandarin, a small experimental CAPT system is built with a synthetic speech data trained recognizer. Evaluation shows that a CAPT system using synthesized data can perform as good as or even better than that using real data provided that the size of the synthetic data are large enough.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Synthesis for Error Training Models in CALL

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker Embedding Extraction with Virtual Phonetic Information
S Sreekanth ... K Sri Rama Murty
-
S Sreekanth, et. al.S Sreekanth ... K Sri Rama Murty
01 Nov 2019
01 Nov 2019

Utterance-Based Selective Training for the Automatic Creation of Task-Dependent Acoustic Models
T Cincarek
IEICE Transactions on Information and Systems | VOL. E89-D
T CincarekT Cincarek
01 Mar 2006
IEICE Transactions on Information and Systems | VOL. E89-D

Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework
Johanes Effendi ... Satoshi Nakamura
-
Johanes Effendi, et. al.Johanes Effendi ... Satoshi Nakamura
25 Oct 2020
25 Oct 2020

Spoken language resources for Cantonese speech processing
Tan Lee ... Helen Meng
Speech Communication | VOL. 36
Tan Lee, et. al.Tan Lee ... Helen Meng
07 Jan 2002
Speech Communication | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Synthesis for Error Training Models in CALL

Abstract

Talk to us

Similar Papers