Abstract

Text normalization (TN) for text-to-speech (TTS) synthesizer is the transformation of non-standard words like times, ordinal numbers, equations, ranges, dates, etc. into standard words that have similarities with their pronunciations. An essential part of all TTS synthesizers is text normalization. Without text normalization, generated voice from the TTS synthesizer will be unintelligible. For the unsatisfactory performance of previous research, a text normalization method for the Bangla language is proposed in this paper. At first, we have produced a tokenized dataset with a semiotic class using regular expressions from a Bangla corpus. Then, each token has been trained using the XGBClassifier algorithm. After that, it identifies the semiotic class for each token in a new Bangla text corpus using the trained XGBClassifier model. Finally, it produces a normalized text for each token by calling the class function according to the predicted class. This text normalization method will help the Bangla TTS synthesizer in producing more intelligible voices. The token classification accuracy of this method is 99.997%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.