A Unified Framework for Multilingual Text-to-Speech Synthesis with SSML Specification as Interface

Zhiyong Wu,Guangqi Cao,M Helen Meng,Lianhong Cai

doi:10.1016/s1007-0214(09)70127-0

A Unified Framework for Multilingual Text-to-Speech Synthesis with SSML Specification as Interface

Zhiyong Wu, Guangqi Cao + Show 2 more

Open Access

https://doi.org/10.1016/s1007-0214(09)70127-0

Copy DOI

Journal: Tsinghua Science & Technology	Publication Date: Sep 30, 2009
Citations: 16

Affiliation: Tsinghua University, University of Hong Kong

#Speech Synthesis Markup Language #Unified Framework + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes the design of a unified framework for a multilingual text-to-speech (TTS) synthesis engine – Crystal. The unified framework defines the common TTS modules for different languages and/or dialects. The interfaces between consecutive modules conform to the speech synthesis markup language (SSML) specification for standardization, interoperability, multilinguality, and extensibility. Detailed module divisions and implementation technologies for the unified framework are introduced, together with possible extensions for the algorithm research and evaluation of the TTS synthesis. Implementation of a mixed-language TTS system for Chinese Putonghua, Chinese Cantonese, and English demonstrates the feasibility of the proposed unified framework.

Full Text