Converting Foreign Accent Speech Without a Reference

Guanlong Zhao,Shaojin Ding,Ricardo Gutierrez-Osuna

doi:10.1109/taslp.2021.3060813

Guanlong Zhao, Shaojin Ding + Show 1 more

Open Access

https://doi.org/10.1109/taslp.2021.3060813

Copy DOI

Abstract

Foreign accent conversion (FAC) is the problem of generating a synthetic voice that has the voice identity of a second-language (L2) learner and the pronunciation patterns of a native (L1) speaker. This synthetic voice has been referred to as a “golden-speaker” in the pronunciation-training literature. FAC is generally achieved by building a voice-conversion model that maps utterances from a source (L1) speaker onto the target (L2) speaker. As such, FAC requires that a reference utterance from the L1 speaker be available at synthesis time. This greatly restricts the application scope of the FAC system. In this work, we propose a “reference-free” FAC system that eliminates the need for reference L1 utterances at synthesis time, and transforms L2 utterances directly. The system is trained in two steps. First, a conventional FAC procedure is used to create a golden-speaker using utterances from a reference L1 speaker (which are then discarded) and the L2 speaker. Second, a pronunciation-correction model is trained to convert L2 utterances to match the golden-speaker utterances obtained in the first step. At synthesis time, the pronunciation-correction model directly transforms a novel L2 utterance into its golden-speaker counterpart. Our results show that the system reduces foreign accents in novel L2 utterances, achieving a 20.5% relative reduction in word-error-rate of an American English automatic speech recognizer and a 19% reduction in perceptual ratings of foreign accentedness obtained through listening tests. Over 73% of the listeners also rated golden-speaker utterances as having the same voice identity as the original L2 utterances.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM transactions on audio, speech, and language processing	Publication Date: Jan 1, 2021
Citations: 9	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Converting Foreign Accent Speech Without a Reference

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on audio, speech, and language processing

Lead the way for us

Similar Papers

Exploring the skill of being "good with accents": A study of monolingual English speakers' ability to reproduce a novel foreign accent
Laura Spinu ... Nadya Pincus
The Journal of The Acoustical Society of America | VOL. 144
Laura Spinu, et. al.Laura Spinu ... Nadya Pincus
01 Sep 2018
The Journal of The Acoustical Society of America | VOL. 144

Greek perception and production of an English vowel contrast: A preliminary study
Václav J Podlipský
The Journal of The Acoustical Society of America | VOL. 117
Václav J PodlipskýVáclav J Podlipský
01 Apr 2005
The Journal of The Acoustical Society of America | VOL. 117

Errors and Learning/Teaching English as a Second/Foreign Language: an Exercise in Grammaticology

Altre Modernità | VOL. 2017

14 Apr 2017
Altre Modernità | VOL. 2017

Reduction of non-native accents through statistical parametric articulatory synthesis.
Sandesh Aryal ... Ricardo Gutierrez-Osuna
The Journal of The Acoustical Society of America | VOL. 137
Sandesh Aryal, et. al.Sandesh Aryal ... Ricardo Gutierrez-Osuna
01 Jan 2015
The Journal of The Acoustical Society of America | VOL. 137

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Converting Foreign Accent Speech Without a Reference

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on audio, speech, and language processing