Modulation spectrum-based post-filter for GMM-based Voice Conversion

Shinnosuke Takamichi,Satoshi Nakamura,Alan W Black,Tomoki Toda

doi:10.1109/apsipa.2014.7041540

Abstract

This paper addresses an over-smoothing effect in Gaussian Mixture Model (GMM)-based Voice Conversion (VC). The flexible use of the statistical approach is one of the major reason why this approach is widely applied to the speech-based systems. However, quality degradation by over-smoothed speech parameter converted is unavoidable problem of statistical modeling. One of common approaches to this over-smoothness in conversion step is to compensate generated features, such as Global Variance (GV), that explicitly express the over-smoothing effect. In statistical Text-To-Speech (TTS) synthesis, we have recently introduced a Modulation Spectrum (MS) which is an extended form of GV, and have proposed MS-based Post-Filter (MSPF) in Hidden Markov Model (HMM)-based TTS synthesis. In this paper, we apply the MSPF to GMM-based VC. Because the MS of speech parameters is degraded through GMM-based conversion process, we perform the post-filter due to MS modification of converted parameters. The experimental evaluation yields the quality benefits by the proposed post-filter.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modulation spectrum-based post-filter for GMM-based Voice Conversion

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis
Shinnosuke Takamichi ... Graham Neubig
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Shinnosuke Takamichi, et. al.Shinnosuke Takamichi ... Graham Neubig
01 Apr 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Parameter generation algorithm considering Modulation Spectrum for HMM-based speech synthesis
Shinnosuke Takamichi ... Satoshi Nakamura
-
Shinnosuke Takamichi, et. al.Shinnosuke Takamichi ... Satoshi Nakamura
01 Apr 2015
01 Apr 2015

A probabilistic interpretation for artificial neural network-based voice conversion
Hsin-Te Hwang ... Yih-Ru Wang
-
Hsin-Te Hwang, et. al.Hsin-Te Hwang ... Yih-Ru Wang
01 Dec 2015
01 Dec 2015

A postfilter to modify the modulation spectrum in HMM-based speech synthesis
Shinnosuke Takamichi ... Sakriani Sakti
-
Shinnosuke Takamichi, et. al.Shinnosuke Takamichi ... Sakriani Sakti
01 May 2014
01 May 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modulation spectrum-based post-filter for GMM-based Voice Conversion

Abstract

Talk to us

Similar Papers