Abstract

A real‐time text‐to‐speech system for English and Japanese has been developed. This system consists of a language processing module, a phonetic acoustic processing module, and a synthesis module. Full general English and Japanese sentences can be converted to speech. The Japanese software and English software are independent except for the synthesis module. The features of this system are as follows. (1) The synthesis module is a phoneme‐based cascade‐parallel formant synthesizer with high observed intelligibility (73.5% for the 119 Japanese monosyllables). (2) This system has a 3000‐morphene English dictionary and 40 000‐word Japanese dictionary with a high‐speed search algorithm. (3) A large speech database was collected for the development of Japanese prosody rules. (4) For the precise control of pitch contour, the Fujisaki model was adopted. (5) One of the two systems developed can stand alone; the other requires a personal computer with a high‐speed DSP board. (6) In the development of this system, some powerful interactive tools have also been developed for varying speech parameters in real time.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.