Language-independent acoustic cloning of HTS voices

Carmen Magariños,Daniel Erro,Eduardo R Banga

doi:10.1016/j.csl.2018.12.006

Abstract

Speaker adaptation techniques can be classified as intra-lingual or cross-lingual depending on whether or not the source model and the target speaker employ the same language. Most of the work in this field has been focused on the first case, while the second one has been less explored. In this paper we address the cross-lingual paradigm in the framework of a HMM-based speech synthesis system by further developing a formerly proposed approach. This method is able to clone a given speaker into a different language by combining the linguistic structure and the acoustic characteristics of two HTS models. In this work, we discuss the extension of the adaptation procedure to some other source model parameters that were kept unmodified in the initial version, and compare the performance of both versions by means of subjective and objective tests. These results are also contrasted with those obtained by a KLD-based technique proposed in the literature for a similar purpose. While no significant preference for any of the versions of our method is observed, our approach clearly outperforms the KLD-based technique.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Language-independent acoustic cloning of HTS voices

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Similar Papers

HMM-based Indonesian speech synthesis system with declarative and question sentences intonation
Elok Cahyaningtyas ... Dhany Arifianto
-
Elok Cahyaningtyas, et. al.Elok Cahyaningtyas ... Dhany Arifianto
01 Nov 2015
01 Nov 2015

The Writing Skill of 3th Grade Students of Sibulue Subdistrict Junior High School of Bone Regency
Rukayah Rukayah
International Journal of Linguistics | VOL. 6
Rukayah RukayahRukayah Rukayah
29 Apr 2014
International Journal of Linguistics | VOL. 6

TH‐E‐224C‐01: Generic Source Models for Commonly Used Clinical Accelerator Beams for Monte Carlo Treatment Planning
J Fan ... L Chen
Medical Physics | VOL. 33
J Fan, et. al.J Fan ... L Chen
01 Jun 2006
TH‐E‐224C‐01: Generic Source Models for Commonly Used Clinical Accelerator Beams for Monte Carlo Treatment Planning
J Fan ... L Chen

The Tyranny of Small Differences: The Culpability Gulf between the Subjective and Objective Tests for Extended Joint Criminal Enterprise in Australia
Laura Stockdale
SSRN Electronic Journal | VOL. -
Laura StockdaleLaura Stockdale
20 Mar 2015
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language-independent acoustic cloning of HTS voices

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language