Abstract
This work presents a rule-based algorithm set used to decide the pronunciation of homographs applied to a Brazilian Portuguese (BP) text-to-speech (TTS) system. The proposed approach is composed of a morphosyntactic analysis, which deals with homographs that belong to different part-of-speech (POS), and a semantic analysis, which deals with homographs that belong to the same POS. The algorithms were implemented to solve ambiguities for 111 homograph pairs organized into 23 disambiguation algorithms, and tested with three types of texts: news, Bible and literature. Computer experiments showed that a correct homograph pronunciation is obtained in 99.00% of the occurrences.
Highlights
I N text-to-speech (TTS) systems, the decision on the pronunciation of heterophonic homographs is a nontrivial problem
The number of homographs usually represents a small percentage of the analyzed text, but in the context of speech synthesis, mistaken phonetic transcriptions produce a bad evaluation of the TTS system, even if it occurs in a small number of times
The proposed approach is composed of a morphosyntactic analysis, which deals with problems of homographs that belong to different POS, and a semantic analysis, which deals with problems of homographs that belong to the same POS
Summary
Resende Jr. Abstract— This work presents a rule-based algorithm set used to decide the pronunciation of homographs applied to a Brazilian Portuguese (BP) text-to-speech (TTS) system. The proposed approach is composed of a morphosyntactic analysis, which deals with homographs that belong to different part-of-speech (POS), and a semantic analysis, which deals with homographs that belong to the same POS. The algorithms were implemented to solve ambiguities for 111 homograph pairs organized into 23 disambiguation algorithms, and tested with three types of texts: news, Bible and literature. Computer experiments showed that a correct homograph pronunciation is obtained in 99.00% of the occurrences
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.