Multi-Layer F0 Modeling for HMM-Based Speech Synthesis

Cheng-Cheng Wang,Zhen-Hua Ling,Bu-Fan Zhang,Li-Rong Dai

doi:10.1109/chinsl.2008.ecp.44

Multi-Layer F0 Modeling for HMM-Based Speech Synthesis

Cheng-Cheng Wang, Zhen-Hua Ling + Show 2 more

Open Access

https://doi.org/10.1109/chinsl.2008.ecp.44

Copy DOI

Publication Date: Dec 1, 2008

Citations: 29

Affiliation: University of Science and Technology of China

#FO Models #Dependent Phoneme + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper proposes a two-layer fundamental frequency (FO) modeling method for HMM-based parametric speech synthesis. The FO models are trained for each context- dependent phoneme in the conventional HMM-based speech synthesis system. Considering the super-segmental characteristics of FO features, an explicit syllable-layer FO model is introduced in this paper. At synthesis stage, the FO contour is generated by maximizing the combined likelihood functions of the phone-layer and syllable-layer FO models. The objective and subjective evaluation results in our experiments show that the proposed multi-layer FO modeling method can improve the performance of FO prediction for emotional speech synthesis.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.