Abstract

In a waveform compilation (waveform concatenation or synthesis-by-rule) type speech synthesis method and speech synthesizer, phoneme waveform segments in natural speech waveforms are clustered, and one of the phoneme waveform segments having a parameter nearest the centroid of LPC parameters of all the phoneme waveforms in each cluster is selected and stored as a representative phoneme waveform in a waveform information memory. When synthesizing a speech waveform, representative phoneme waveforms of the same phonemes, whose context is most similar to that of each phoneme of a phoneme string of the speech to be synthesized, are selectively read out of the waveform information memory and thus read-out representative phoneme waveforms are sequentially concatenated for output as a continuous synthesized speech waveform.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call