Abstract
This paper presents a text-to-speech (TTS) system, capable of synthesis of continuous Slovenian speech. The system is based on the concatenation of basic speech units, diphones, using TD-PSOLA technique improved with a variable length linear interpolation process. Input text is processed by a series of modules which are described in detail. A special attention is given to modeling the F0 contour, mainly based on the so-called superpositional approach. This system is experimentally used in an employment agent EMA that provides employment information through the Internet.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have