Abstract

Multipulse linear prediction is a coding technique that can provide extremely high quality speech synthesis [B. S. Atal and J. R. Remde, IEEE Proc. ICASSP82, 614–617 (1982)]. It is therefore of interest to examine whether the technique can be used to provide correspondingly good quality in a text‐to‐speech system. In such a system a prosodic contour is imposed on a set of concentrated speech units. Various speech units (e.g., words and diphones) have been tried. First results suggest that the word is the appropriate unit to be used in such a system. The techniques for pitch and timing alteration, speech unit concatention, and the effects on the resulting synthetic speech will be discussed and demonstrated.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.