Abstract

Unit selection speech systems generate synthetic speech by concatenation of acoustic units that are chosen by means of a Viterbi search on a previously recorded corpus of the same speaker. Given the finite nature of the corpus, there is usually a trade-off between the acoustic quality of the signal, and the quality of the prosody. Proposed is a parallel Viterbi search considering multiple intonation contours that improves at the same time both acoustic quality and intonation, while maintaining the computational efficiency of the process.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.