Abstract

An overview of a speech synthesis system is given, with a special emphasis on hierachical structure to generate speech sounds from an input text. A speech synthesis system generally consisits of four phases: linguistic analysis of a text, phonological/phonetic processing, synthesis parameter generation, and sound synthesis. A statistical model to represent durations of phone segments and a linear time invariant system model to generate fundamental frequency contours are shown to have been successful in improving sound quality of synthetic speech sounds.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call