Abstract

English sentences are assigned to a small number of intonational classes. Each class has an associated fundamental frequency template, which characterizes the F0 contour by specifying up to two frequency points per word—typically a peak frequency and a final frequency. The F0 contour for a particular sentence is elaborated by cubic polynomial interpolation of the template's frequency points, fixed in time by the actual durations of the words of the sentence. Similar sentences with different numbers of words can be assigned to the same intonational class if short phrases are described by a single pair of frequency points, which are then algorithmically elaborated. In a practical application the name of a template is stored with the text of a sentence, and the spoken sentence is synthesized by applying the template to the sequence of LPC-encoded words in real time. A demonstration tape will be played.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.