Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora

Yuewen Cao,Shiyin Kang,Dan Su,Songxiang Liu,Zhiyong Wu,Peng Liu,Xixin Wu,Helen Meng,Dong Yu,Xunying Liu

doi:10.1109/icassp40776.2020.9053094

Abstract

Synthesizing fluent code-switched (CS) speech with consistent voice using only monolingual corpora is still a challenging task, since language alternation seldom occurs during training and the speaker identity is directly correlated with language. In this paper, we present a bilingual phonetic posteriorgram (PPG) based CS speech synthesizer using only monolingual corpora. The bilingual PPG is used to bridge across speakers and languages, which is formed by stacking two monolingual PPGs extracted from two monolingual speaker-independent speech recognition systems. It is assumed that bilingual PPG can represent the articulation of speech sounds speaker-independently and captures accurate phonetic information of both languages in the same feature space. The proposed model first extracts bilingual PPGs from training data. Then an encoder- decoder based model is used to learn the relationship between input text and bilingual PPGs, and the bilingual PPGs are mapped to acoustic features using bidirectional long-short term memory based model conditioned on speaker embedding to control speaker identity. Experiments validate the effectiveness of the proposed model in terms of speech intelligibility, audio fidelity and speaker consistency of the generated code-switched speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech
Emre Yılmaz ... Henk Van Den Heuvel
-
Emre Yılmaz, et. al.Emre Yılmaz ... Henk Van Den Heuvel
02 Sep 2018
02 Sep 2018

Speech Chain for Semi-Supervised Learning of Japanese-English Code-Switching ASR and TTS
Sahoko Nakayama ... Satoshi Nakamura
-
Sahoko Nakayama, et. al.Sahoko Nakayama ... Satoshi Nakamura
01 Dec 2018
01 Dec 2018

Modeling Bilingual Lexical Processing Through Code-Switching Speech: A Network Science Approach.
Qihui Xu ... Magdalena Markowska
Frontiers in Psychology | VOL. 12
Qihui Xu, et. al.Qihui Xu ... Magdalena Markowska
25 Aug 2021
Frontiers in Psychology | VOL. 12

Leveraging BERT to Improve Spoken Language Identification of Code-Switching Speech
Yuting Nie ... Wei-Qiang Zhang
International Journal of Asian Language Processing | VOL. 34
Yuting Nie, et. al.Yuting Nie ... Wei-Qiang Zhang
01 Mar 2024
International Journal of Asian Language Processing | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Code-Switched Speech Synthesis Using Bilingual Phonetic Posteriorgram with Only Monolingual Corpora

Abstract

Talk to us

Similar Papers