Abstract

Back off techniques are employed in syllable based unit selection speech synthesis systems to maintain the naturalness of the speech in spite of the missing syllables. In synthesizing the missing complex consonant clusters syllables of Telugu, we introduced reduced vowel epenthesis as a rule-based backoff strategy[1]. In this paper, we refine the scope of the approach in selectively applying vowel epenthesis only in cases of sonority rise between adjacent consonants. When the sonority does not rise (stop-stop, liquid-stop clusters), we increase the duration of the consonant. Owing to specific patterns of vowel epenthesis observed in languages, we conduct a subjective evaluation to determine the identity of the epenthetic vowel in Hindi. From the inferences of the listening test, we devise a class based rule to perform epenthesis. Further, to evaluate the performance of the designed system, we perform both subjective as well as an objective evaluation based on confidence measures from an ASR system. We conduct a phone level automatic speech recognition task on the intelligibility of the words synthesized using epenthesis as a cluster-repair strategy. The results show that the proposed back off method helps in producing more natural-sounding speech compared to the conventional backoffs.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.