Objectives: Designing dynamic computer systems that are effective, efficient, simple, and satisfying to use is becoming extremely important in this age of information and communication technology. Text to Speech or Speech Synthesis is one of the many methods being investigated by researchers to improve Human-Computer Interaction. The goal here is to improve the text processing component of the Tamil voice synthesizer by including a text normalizer and loan word identification that is efficient and reliable. Methods: Text normalization is conducted on unconstrained Tamil text to turn non-standard terms into common words to reduce confusing utterances during intermediate word processing. Loan/Native words in Tamil literature are detected to enhance the Tamil voice synthesizer system’s pronunciation model. Findings: During normalization, non-standard Tamil words are replaced with standard ones to reduce ambiguous utterances during interim processing. A pronunciation model is built to improve the Tamil speech synthesizer system by identifying loan words in Tamil text. A syllable classifier is presented in this study, based on a decision list approach, which can handle various types of non-stationary sounds. Novelty: We also disclose a ’loan/native word classifier’ based on multiple linear regressions that perform well even with small words of three syllables. Such sophisticated text processors are required in today’s dominating Digital, Information-Communication Technology, and Human-Computer Interaction age. Keywords: Mobile Communication Technology; HumanComputer Interaction; Speech Synthesis Affirms; Syllable Classifier; Prerecorded database
Read full abstract