Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling

D Erro,I Hernaez,E Navas

doi:10.1109/tasl.2012.2227735

Abstract

Voice conversion methods based on frequency warping followed by amplitude scaling have been recently proposed. These methods modify the frequency axis of the source spectrum in such manner that some significant parts of it, usually the formants, are moved towards their image in the target speaker's spectrum. Amplitude scaling is then applied to compensate for the differences between warped source spectra and target spectra. This article presents a fully parametric formulation of a frequency warping plus amplitude scaling method in which bilinear frequency warping functions are used. Introducing this constraint allows for the conversion error to be described in the cepstral domain and to minimize it with respect to the parameters of the transformation through an iterative algorithm, even when multiple overlapping conversion classes are considered. The paper explores the advantages and limitations of this approach when applied to a cepstral representation of speech. We show that it achieves significant improvements in quality with respect to traditional methods based on Gaussian mixture models, with no loss in average conversion accuracy. Despite its relative simplicity, it achieves similar performance scores to state-of-the-art statistical methods involving dynamic features and global variance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Mar 1, 2013
Citations: 73

Similar Papers

Novel Amplitude Scaling method for bilinear frequency Warping-based Voice Conversion
Nirmesh J Shah ... Hemant A Patil
-
Nirmesh J Shah, et. al.Nirmesh J Shah ... Hemant A Patil
01 Mar 2017
01 Mar 2017

Voice conversion by combining frequency warping with unit selection
Zhiwei Shuang ... Fanping Meng
-
Zhiwei Shuang, et. al. Zhiwei Shuang ... Fanping Meng
01 Mar 2008
01 Mar 2008

An Exemplar-Based Approach to Frequency Warping for Voice Conversion
Xiaohai Tian ... Siu Wa Lee
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 25
Xiaohai Tian, et. al.Xiaohai Tian ... Siu Wa Lee
01 Oct 2017
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 25

Piecewise linear definition of transformation functions for speaker de-identification
Carmen Magarinos ... Paula Lopez-Otero
-
Carmen Magarinos, et. al.Carmen Magarinos ... Paula Lopez-Otero
01 Jul 2016
01 Jul 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parametric Voice Conversion Based on Bilinear Frequency Warping Plus Amplitude Scaling

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing