Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis

Yi-Chin Huang,Chung-Hsien Wu,Sz-Ting Weng

doi:10.1109/iscslp.2012.6423536

Abstract

In this paper, a novel hierarchical prosodic unit selection method is proposed based on pitch contour pattern retrieval, in order to obtained natural pitch contour of the personalized synthetic voice. In this framework, a hierarchical prosodic unit based on Fujisaki model is used to take local pitch contour variation and global intonation of utterance into account. Furthermore, novel ways of integrating pitch contour pattern of prosodic units in the prosodic model are invents in order to improve the selection mechanism of the appropriate pitch contour. A novel prosodic unit selection method is proposed based on sentence retrieval, which not only uses the traditional linguistic cue as selection criterion, but also the shape of the pitch contour. Also, the codewords of pitch patterns in the training corpus and synthesized corpus were constructed by the proposed method and were used to map the relation between training codeword and synthesized corpus. Finally, the language model of pitch pattern is adopted to find the proper pitch pattern sequence of input text. The evaluation results demonstrate that the proposed prosodic model substantially improves naturalness of the intonation of the synthesized speech compared to that of model-based method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Personalized natural speech synthesis based on retrieval of pitch patterns using hierarchical Fujisaki model
Yi-Chin Huang ... Shih-Lun Lin
-
Yi-Chin Huang, et. al.Yi-Chin Huang ... Shih-Lun Lin
01 May 2013
01 May 2013

An Iterated Two-Step Sinusoidal Pitch Contour Formulation for Expressive Speech Synthesis
Izzad Ramli ... Nursuriati Jamil
Journal of Information and Communication Technology | VOL. 20
Izzad Ramli, et. al.Izzad Ramli ... Nursuriati Jamil
01 Jan 2020
Journal of Information and Communication Technology | VOL. 20

중국인 학습자들의 한국어 낭독 문장 피치곡선의 변동 양상
Youngsook Yune
-
Youngsook YuneYoungsook Yune
30 Jun 2013
30 Jun 2013

Statistical Pitch Conversion Approaches Based on Korean Accentual Phrases
Ki Young Lee ... Myung Jin Bae
-
Ki Young Lee, et. al.Ki Young Lee ... Myung Jin Bae
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical prosodic pattern selection based on Fujisaki model for natural mandarin speech synthesis

Abstract

Talk to us

Similar Papers