Efficient Implementation of Global Variance Compensation for Parametric Speech Synthesis

Takashi Nose

doi:10.1109/taslp.2016.2580298

Abstract

This paper proposes a simple and efficient technique for variance compensation to improve the perceptual quality of synthetic speech in parametric speech synthesis. First, we analyze the problem of spectral and F0 enhancement with global variance (GV) in HMM-based speech synthesis. In the conventional GV-based parameter generation, the enhancement is achieved by taking account of a GV probability density function with fixed GV model parameters for every output utterance through the speech parameter generation process. We find that the use of fixed GV parameters results in much smaller variations of GVs in synthesized utterances than those in natural speech. In addition, the computational cost is high because of iterative optimization. This paper examines these issues in terms of multiple objective measures such as variance characteristics, GV distortions, and GV correlations. We propose a simple and fast compensation method based on a global affine transformation that provides a GV distribution closer to that of natural speech and improves the correlation of GVs between natural and generated parameter sequences. The experimental results demonstrate that the proposed variance compensation methods outperform the conventional GV-based parameter generation in terms of objective and subjective speech similarity to natural speech while maintaining speech naturalness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Implementation of Global Variance Compensation for Parametric Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Sep 10, 2016
Citations: 38

Similar Papers

Analysis of spectral enhancement using global variance in HMM-based speech synthesis
Takashi Nose ... Akinori Ito
-
Takashi Nose, et. al.Takashi Nose ... Akinori Ito
14 Sep 2014
14 Sep 2014

Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis
Shinnosuke Takamichi ... Satoshi Nakamura
-
Shinnosuke Takamichi, et. al.Shinnosuke Takamichi ... Satoshi Nakamura
01 Apr 2015
01 Apr 2015

Global Variance in Speech Synthesis With Linear Dynamical Models
Vassilis Tsiaras ... Vassilis Diakoloukas
IEEE Signal Processing Letters | VOL. 23
Vassilis Tsiaras, et. al.Vassilis Tsiaras ... Vassilis Diakoloukas
01 Aug 2016
IEEE Signal Processing Letters | VOL. 23

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis
Yu-Sheng Sun ... Zhen-Hua Ling
-
Yu-Sheng Sun, et. al.Yu-Sheng Sun ... Zhen-Hua Ling
01 Sep 2014
01 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Implementation of Global Variance Compensation for Parametric Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing