Abstract

This work presents a rule-based algorithm set used to decide the pronunciation of homographs applied to a Brazilian Portuguese (BP) text-to-speech (TTS) system. The proposed approach is composed of a morphosyntactic analysis, which deals with homographs that belong to different part-of-speech (POS), and a semantic analysis, which deals with homographs that belong to the same POS. The algorithms were implemented to solve ambiguities for 111 homograph pairs organized into 23 disambiguation algorithms, and tested with three types of texts: news, Bible and literature. Computer experiments showed that a correct homograph pronunciation is obtained in 99.00% of the occurrences.

Highlights

  • I N text-to-speech (TTS) systems, the decision on the pronunciation of heterophonic homographs is a nontrivial problem

  • The number of homographs usually represents a small percentage of the analyzed text, but in the context of speech synthesis, mistaken phonetic transcriptions produce a bad evaluation of the TTS system, even if it occurs in a small number of times

  • The proposed approach is composed of a morphosyntactic analysis, which deals with problems of homographs that belong to different POS, and a semantic analysis, which deals with problems of homographs that belong to the same POS

Read more

Summary

A Rule-Based Method for Homograph Disambiguation in Brazilian Portuguese

Resende Jr. Abstract— This work presents a rule-based algorithm set used to decide the pronunciation of homographs applied to a Brazilian Portuguese (BP) text-to-speech (TTS) system. The proposed approach is composed of a morphosyntactic analysis, which deals with homographs that belong to different part-of-speech (POS), and a semantic analysis, which deals with homographs that belong to the same POS. The algorithms were implemented to solve ambiguities for 111 homograph pairs organized into 23 disambiguation algorithms, and tested with three types of texts: news, Bible and literature. Computer experiments showed that a correct homograph pronunciation is obtained in 99.00% of the occurrences

INTRODUCTION
APPLIED METHODOLOGY
COMPUTER EXPERIMENTS
CONCLUSIONS
Findings
20: Go to Algorithm 5

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.