Abstract

This paper presents a text-to-speech (TTS) system, capable of synthesis of continuous Slovenian speech. The system is based on the concatenation of basic speech units, diphones, using TD-PSOLA technique improved with a variable length linear interpolation process. Input text is processed by a series of modules which are described in detail. A special attention is given to modeling the F0 contour, mainly based on the so-called superpositional approach. This system is experimentally used in an employment agent EMA that provides employment information through the Internet.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call