TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

Yanping Li,Hui Ding,Linghua Zhang

doi:10.1142/s0219878911002537

Abstract

This paper proposed a novel algorithm for text-independent voice conversion based on Chinese phoneme classification and kernel eigenvoices Gaussian mixture model. The phoneme classification can avoid the disturbance of linguistic information and spectral smoothing. A speaker adaptation technique of kernel eigenvoices was employed for performing spectral conversion between speakers for each category phoneme, adapting the conversion parameters derived for the pre-stored pairs of speakers to a desired pair, which can relax the parallel constraint effectively. Objective test on the spectral conversion accuracy demonstrated that the proposed kernel algorithm can effectively exploit the nonlinearity in supervector space. In subjective listening test, an ABX test was performed and the proposed algorithm was preferred to the existing eigenvoice algorithm by 4.75%, and improved quality by 10.91% in terms of mean opinion score (MOS). Both objective and subjective tests demonstrated that the proposed algorithm effectively enhanced speech quality and speaker individuality in a text-independent manner.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Acquisition

Lead the way for us

Similar Papers

Nonparallel voice conversion based on phoneme classification and eigenvoices
Yan-Ping Li ... Ding Hui
-
Yan-Ping Li, et. al.Yan-Ping Li ... Ding Hui
01 Nov 2010
01 Nov 2010

An algorithm for Chinese Voice conversion based on phonetic Gaussian mixture model
Yanping Li ... Linghua Zhang
-
Yanping Li, et. al.Yanping Li ... Linghua Zhang
01 Oct 2010
01 Oct 2010

Robust processing techniques for voice conversion
Oytun Turk ... Levent M Arslan
Computer Speech & Language | VOL. 20
Oytun Turk, et. al.Oytun Turk ... Levent M Arslan
12 Jul 2005
Computer Speech & Language | VOL. 20

QoE-Driven Integrated Heterogeneous Traffic Resource Allocation Based on Cooperative Learning for 5G Cognitive Radio Networks
Fatemeh Shah Mohammadi ... Andres Kwasinski
-
Fatemeh Shah Mohammadi, et. al.Fatemeh Shah Mohammadi ... Andres Kwasinski
01 Jul 2018
01 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TEXT-INDEPENDENT VOICE CONVERSION BASED ON CHINESE PHONEME CLASSIFICATION AND KERNEL EIGENVOICES GAUSSIAN MIXTURE MODEL

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Acquisition