Automatic generation of prosodic structure for high quality Mandarin speech synthesis

Fu-Chiang Chou Fu-Chiang Chou,Lin-Shan Lee Lin-Shan Lee,Chiu-Yu Tseng Chiu-Yu Tseng

doi:10.1109/icslp.1996.607935

Automatic generation of prosodic structure for high quality Mandarin speech synthesis

Fu-Chiang Chou Fu-Chiang Chou, Lin-Shan Lee Lin-Shan Lee + Show 1 more

Open Access

https://doi.org/10.1109/icslp.1996.607935

Copy DOI

Publication Date: Oct 3, 1996

Citations: 26

Affiliation: Institute of Information Science, Academia Sinica, National Taiwan University, Institute of History and Philology, Academia Sinica

#Prosodic Structure #Speech Synthesis Technology + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A key problem for today's speech synthesis technology is to automatically generate an appropriate hierarchical prosodic structure for text input and incorporate it into synthesized speech. The paper presents a method for such a problem in Mandarin Chinese. This method uses a speech database for the training of a statistical model to generate the prosodic structure and determine prosodic parameters such as syllable duration, pause, energy and intonation. The experimental results show that an accuracy of 83.1% in the prediction of prosodic structure can be achieved. Furthermore, a Chinese text-to-speech system can be developed based on the proposed prosodic structure.

Full Text