Average Voice Modeling Based on Unbiased Decision Trees

Fahimeh Bahmaninezhad,Hossein Sameti,Soheil Khorram

doi:10.1007/978-3-642-38847-7_12

Abstract

Speaker adaptive speech synthesis based on Hidden Semi-Markov Model (HSMM) has been demonstrated to be dramatically effective in the presence of confined amount of speech data. However, we could intensify this effectiveness by training the average voice model appropriately. Hence, this study presents a new method for training the average voice model. This method guarantees that data from every speaker contributes to all the leaves of decision tree. We considered this fact that small training data and highly diverse contexts of training speakers are considered as disadvantages which degrade the quality of average voice model impressively, and further influence the adapted model and synthetic speech unfavorably. The proposed method takes such difficulties into account in order to train a tailored average voice model with high quality. Consequently, as the experiments indicate, the proposed method outweighs the conventional one not only in the quality of synthetic speech but also in similarity to the natural voice. Our experiments show that the proposed method increases the CMOS test score by 0.6 to the conventional one.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Average Voice Modeling Based on Unbiased Decision Trees

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A training method for average voice model based on shared decision tree context clustering and speaker adaptive training
J Yamagishi ... K Tokuda
-
J Yamagishi, et. al.J Yamagishi ... K Tokuda
06 Apr 2003
06 Apr 2003

Cluster adaptive training of average voice models
Vincent Wan ... Javier Latorre
-
Vincent Wan, et. al.Vincent Wan ... Javier Latorre
01 May 2014
01 May 2014

Building HMM-TTS Voices on Diverse Data
Vincent Wan ... Norbert Braunschweiler
IEEE Journal of Selected Topics in Signal Processing | VOL. 8
Vincent Wan, et. al.Vincent Wan ... Norbert Braunschweiler
01 Apr 2014
IEEE Journal of Selected Topics in Signal Processing | VOL. 8

Evaluation of speaker-dependent and average-voice Vietnamese statistical speech synthesis systems
Duy Ninh Khánh
Journal of Science and Technology Issue on Information and Communications Technology | VOL. 17
Duy Ninh KhánhDuy Ninh Khánh
31 Dec 2020
Journal of Science and Technology Issue on Information and Communications Technology | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Average Voice Modeling Based on Unbiased Decision Trees

Abstract

Talk to us

Similar Papers