Text Analyses

Xu Tan

doi:10.1007/978-981-99-0827-1_4

Abstract

Through text analyses, we can transform input text into linguistic features, which contain rich information about pronunciation and prosody that can ease speech synthesis. Text analyses consist of several components: (1) text processing, which processes raw text from documents, normalizes the text from the written form into spoken form, and conducts some linguistic analyses; (2) phonetic analysis, which converts text into phonetic symbols, including polyphone disambiguation and grapheme-to-phoneme conversion; (3) prosodic analysis, which analyzes some prosodic features such as pitch, duration, loudness, stress, and pauses. In this chapter, we first introduce these components in the first three sections and then discuss the development of text analysis in TTS in the last section.

Full Text