Scalable Concatenative Speech Synthesis Based on the Plural Unit Selection and Fusion Method

M Tamura,T Kagoshima,T Mizutani

doi:10.1109/icassp.2005.1415125

Scalable Concatenative Speech Synthesis Based on the Plural Unit Selection and Fusion Method

M Tamura, T Kagoshima + Show 1 more

https://doi.org/10.1109/icassp.2005.1415125

Copy DOI

Publication Date: Mar 18, 2005

Citations: 8

Affiliation: Toshiba (Japan)

#Concatenative Speech Synthesizer #Unit Selection Method + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Recently, concatenative speech synthesizers with large databases have been widely developed for high-quality speech synthesis. However, some platforms require a speech synthesis system that can work under the limitation of memory footprint or computational cost. In this paper, we propose a scalable concatenative speech synthesizer based on the plural speech unit selection and fusion method. To realize scalability, we propose the offline unit fusion method in which pitch-cycle waveforms for voiced segments are fused in advance. The experimental results show that the synthetic speech of the offline unit fusion method with half-size waveform database is comparable to that of the online unit fusion method, while the computation cost is reduced to 1/10.

Full Text