HMM-Based Style Control for Expressive Speech Synthesis with Arbitrary Speaker's Voice Using Model Adaptation

Takashi Nose,Takao Kobayashi,Makoto Tachibana

doi:10.1587/transinf.e92.d.489

Takashi Nose, Takao Kobayashi + Show 1 more

Open Access

https://doi.org/10.1587/transinf.e92.d.489

Copy DOI

Abstract

This paper presents methods for controlling the intensity of emotional expressions and speaking styles of an arbitrary speaker's synthetic speech by using a small amount of his/her speech data in HMM-based speech synthesis. Model adaptation approaches are introduced into the style control technique based on the multiple-regression hidden semi-Markov model (MRHSMM). Two different approaches are proposed for training a target speaker's MRHSMMs. The first one is MRHSMM-based model adaptation in which the pretrained MRHSMM is adapted to the target speaker's model. For this purpose, we formulate the MLLR adaptation algorithm for the MRHSMM. The second method utilizes simultaneous adaptation of speaker and style from an average voice model to obtain the target speaker's style-dependent HSMMs which are used for the initialization of the MRHSMM. From the result of subjective evaluation using adaptation data of 50 sentences of each style, we show that the proposed methods outperform the conventional speaker-dependent model training when using the same size of speech data of the target speaker.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEICE Transactions on Information and Systems	Publication Date: Jan 1, 2009
Citations: 41	License type: free

R Discovery Prime

R Discovery Prime

HMM-Based Style Control for Expressive Speech Synthesis with Arbitrary Speaker's Voice Using Model Adaptation

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems

Lead the way for us

Similar Papers

Speaker-independent style conversion for HMM-based expressive speech synthesis
Hiroki Kanagawa ... Takashi Nose
-
Hiroki Kanagawa, et. al.Hiroki Kanagawa ... Takashi Nose
01 May 2013
01 May 2013

Improving the performance of HMM-based voice conversion using context clustering decision tree and appropriate regression matrix format
Long Qin ... Zhen-Hua Ling
-
Long Qin, et. al.Long Qin ... Zhen-Hua Ling
17 Sep 2006
17 Sep 2006

Unsupervised Speaker Adaptation for DNN-based Speech Synthesis using Input Codes
Shinji Takaki ... Junichi Yamagishi
-
Shinji Takaki, et. al.Shinji Takaki ... Junichi Yamagishi
01 Nov 2018
01 Nov 2018

Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis
Makoto Tachibana ... Takao Kobayashi
-
Makoto Tachibana, et. al. Makoto Tachibana ... Takao Kobayashi
01 Mar 2008
01 Mar 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HMM-Based Style Control for Expressive Speech Synthesis with Arbitrary Speaker's Voice Using Model Adaptation

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems