Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis

Wensheng Gao ,Qiying Cao

doi:10.6688/jise.2014.30.4.13

Abstract

Speaker adaptation in speech synthesis transforms a source utterance to a target utterance that differs from the source in terms of voice characteristics. In this paper, we employ vocal tract length normalization, which is generally used in speech recognition to remove individual speaker characteristics, to speaker adaptation in speech synthesis. We propose a frequency warping approach based on a time-varying bilinear function to reduce the weighted spectral distance between the source speaker and the target speaker. The warped spectra of the source speaker are then converted to line spectrum pairs to train hidden Markov models (HMM). HMMs are further adapted by algorithms based on maximum likelihood linear regression with the target speaker's data. The experimental results show that our frequency warping approach can make the warped spectra of the source speaker closer to the target speaker, and the resultant adapted HMMs perform better than the HMMs trained by unwrapped spectra in terms of synthesized speech naturalness and speaker similarity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: Journal of Information Science and Engineering

Lead the way for us

Journal: Journal of Information Science and Engineering	Publication Date: Jul 1, 2014
Citations: 28

Similar Papers

Frequency warping for speaker adaption of text-to-speech synthesis
Weixun Gao ... Qiying Cao
-
Weixun Gao, et. al. Weixun Gao ... Qiying Cao
01 Jan 2009
01 Jan 2009

Speaker and style adaptation using average voice model for style control in HMM-based speech synthesis
Makoto Tachibana ... Takao Kobayashi
-
Makoto Tachibana, et. al. Makoto Tachibana ... Takao Kobayashi
01 Mar 2008
01 Mar 2008

Modeling of Speech Parameter Sequence Considering Global Variance for HMM-Based Speech Synthesis
Tomoki Toda
-
Tomoki TodaTomoki Toda
19 Apr 2011
19 Apr 2011

An HMM-Based Approach to Flexible Speech Synthesis
Keiichi Tokuda
-
Keiichi TokudaKeiichi Tokuda
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Frequency Warping for Speaker Adaptation in HMM-based Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: Journal of Information Science and Engineering