Minimum unit selection error training for HMM-based unit selection speech synthesis system

Zhen-Hua Ling Zhen-Hua Ling,Ren-Hua Wang Ren-Hua Wang

doi:10.1109/icassp.2008.4518518

Zhen-Hua Ling Zhen-Hua Ling, Ren-Hua Wang Ren-Hua Wang

https://doi.org/10.1109/icassp.2008.4518518

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper presents a minimum unit selection error (MUSE) training method for HMM-based unit selection speech synthesis system, which selects the optimal phone-sized unit sequence from the speech database by maximizing the combined likelihood of a group of trained HMMs. Under MUSE criterion, the weights and distribution parameters of these HMMs are estimated to minimize the number of different units between the selected phone sequences and the natural phone sequences for the training sentences. The optimization is realized by discriminative training using generalized probabilistic descent (GPD) algorithm. Results of our experiment show that this proposed method is able to improve the performance of the baseline system where model weights are set manually and distribution parameters are trained under maximum likelihood criterion.

Full Text