A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction

S Takano,H Mizuno,M Abe,S Nakajima,K Tanaka

doi:10.1109/89.890065

Abstract

This paper proposes a new text-to-speech (TTS) system that utilizes large numbers of speech segments to produce very natural and intelligible synthetic speech. There are two innovations; new multiform synthesis units and a new speech modification algorithm based on a vocoder that offers harmonics reconstruction. The multiform units make it possible to reduce acoustic discontinuities at concatenation points and unnatural sound by preparing synthesis units with various lengths and various F/sub 0/ contours. The new speech modification algorithm, on the other hand, improves the quality of prosody modified speech. This algorithm is extremely effective in synthesizing speech whose prosodic parameters are quite different from those of synthesis units. Listening tests confirm that the new synthesis units yield speech with high intelligibility and naturalness, and that the new speech modification algorithm is superior to all other conventional vocoders and waveform domain algorithms including TD-PSOLA, especially when modifying the F/sub 0/ frequency upward.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing

Lead the way for us

Journal: IEEE Transactions on Speech and Audio Processing	Publication Date: Jan 1, 2001
Citations: 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Speech and Audio Processing