DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

M Kiran Reddy,K Sreenivasa Rao

doi:10.1007/s11063-019-10149-y

Abstract

Cross-lingual voice conversion (CLVC) is a quite challenging task since the source and target speakers speak different languages. This paper proposes a CLVC framework based on bottleneck features and deep neural network (DNN). In the proposed method, the bottleneck features extracted from a deep auto-encoder (DAE) are used to represent speaker-independent features of speech signals from different languages. A DNN model is trained to learn the mapping between bottleneck features and the corresponding spectral features of the target speaker. The proposed method can capture speaker-specific characteristics of a target speaker, and hence requires no speech data from source speaker during training. The performance of the proposed method is evaluated using data from three Indian languages: Telugu, Tamil and Malayalam. The experimental results show that the proposed method outperforms the baseline Gaussian mixture model (GMM)-based CLVC approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters

Lead the way for us

Journal: Neural Processing Letters	Publication Date: Nov 7, 2019
Citations: 1

Similar Papers

A Multi-level GMM-Based Cross-Lingual Voice Conversion Using Language-Specific Mixture Weights for Polyglot Synthesis
B Ramani ... M P Actlin Jeeva
Circuits, Systems, and Signal Processing | VOL. 35
B Ramani, et. al.B Ramani ... M P Actlin Jeeva
10 Jul 2015
Circuits, Systems, and Signal Processing | VOL. 35

An Approach to Cross-Lingual Voice Conversion
Sai Sirisha Rallabandi ... Suryakanth V Gangashetty
-
Sai Sirisha Rallabandi, et. al.Sai Sirisha Rallabandi ... Suryakanth V Gangashetty
01 Jul 2019
01 Jul 2019

On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion
Berrak Sisman ... Haizhou Li
-
Berrak Sisman, et. al.Berrak Sisman ... Haizhou Li
01 Dec 2019
01 Dec 2019

Investigation of different acoustic modeling techniques for low resource Indian language data
Sriranjani R ... Murali Karthick B
-
Sriranjani R, et. al. Sriranjani R ... Murali Karthick B
01 Feb 2015
01 Feb 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DNN-Based Cross-Lingual Voice Conversion Using Bottleneck Features

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters