A Japanese text‐to‐speech conversion system using pitch‐controlled residual wave excitation

Kazuhiko Iwata,Kazunori Ozawa,Takao Watanabe,Yukio Mitome

doi:10.1121/1.2027795

Abstract

A Japanese text‐to‐speech conversion system was developed that could generate fluent speech by concatenating CV and VC units. A new synthesis method was proposed, where residual waves were used as excitation signals for an LPC synthesis filter in all portions of each unit. LPC filter coefficients were calculated by approximating spectral envelopes extracted by the improved cepstral analysis method, which was less affected by pitch frequency than the conventional LPC method. Therefore, the synthetic speech generated by the proposed method had extremely high quality, even when pitch frequencies of the residual waves were widely changed. Moreover, in order to realize natural rhythms, phoneme duration was determined by new rules. The rules were created by taking into account the various contexts (such as preceding and following phonemes and the position of a phoneme in a phrase), and by statistically analyzing a large speech database. A comprehension test using 1000 phonetic balanced words showed that the proposed system achieved 97.4% of high accuracy rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Japanese text‐to‐speech conversion system using pitch‐controlled residual wave excitation

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

A perceptual experiment on voice individuality by altering pitch and formant frequencies
Hisao Kuwabara
The Journal of the Acoustical Society of America | VOL. 100
Hisao KuwabaraHisao Kuwabara
01 Oct 1996
The Journal of the Acoustical Society of America | VOL. 100

Method of controlling high-speed reading in a text-to-speech conversion system
Keiichi Chihara
The Journal of the Acoustical Society of America | VOL. 127
Keiichi ChiharaKeiichi Chihara
01 Jan 2009
The Journal of the Acoustical Society of America | VOL. 127

Comprehensive testing for, and diagnosis of, sexually transmissible infections among Australian gay and bisexual men: findings from repeated, cross-sectional behavioural surveillance, 2003–2012
Martin Holt ... Chris Bourne
Sexually Transmitted Infections | VOL. 90
Martin Holt, et. al.Martin Holt ... Chris Bourne
14 Nov 2013
Sexually Transmitted Infections | VOL. 90

Two-Stage Gender Identification Using Pitch Frequencies, MFCCs and HMMs
Rong Phoophuangpairoj ... Sukanya Phongsuphap
-
Rong Phoophuangpairoj, et. al.Rong Phoophuangpairoj ... Sukanya Phongsuphap
01 Oct 2015
01 Oct 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Japanese text‐to‐speech conversion system using pitch‐controlled residual wave excitation

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America