Voice conversion using General Regression Neural Network

Jagannath Nirmal,Mukesh Zaveri,Suprava Patnaik,Pramod Kachare

doi:10.1016/j.asoc.2014.06.040

Abstract

The objective of voice conversion system is to formulate the mapping function which can transform the source speaker characteristics to that of the target speaker. In this paper, we propose the General Regression Neural Network (GRNN) based model for voice conversion. It is a single pass learning network that makes the training procedure fast and comparatively less time consuming. The proposed system uses the shape of the vocal tract, the shape of the glottal pulse (excitation signal) and long term prosodic features to carry out the voice conversion task. In this paper, the shape of the vocal tract and the shape of source excitation of a particular speaker are represented using Line Spectral Frequencies (LSFs) and Linear Prediction (LP) residual respectively. GRNN is used to obtain the mapping function between the source and target speakers. The direct transformation of the time domain residual using Artificial Neural Network (ANN) causes phase change and generates artifacts in consecutive frames. In order to alleviate it, wavelet packet decomposed coefficients are used to characterize the excitation of the speech signal. The long term prosodic parameters namely, pitch contour (intonation) and the energy profile of the test signal are also modified in relation to that of the target (desired) speaker using the baseline method. The relative performances of the proposed model are compared to voice conversion system based on the state of the art RBF and GMM models using objective and subjective evaluation measures. The evaluation measures show that the proposed GRNN based voice conversion system performs slightly better than the state of the art models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Voice conversion using General Regression Neural Network

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Jul 11, 2014
Citations: 31

Similar Papers

Comparing ANN and GMM in a voice conversion framework
R.H Laskar ... K Banerjee
Applied Soft Computing Journal | VOL. 12
R.H Laskar, et. al.R.H Laskar ... K Banerjee
05 Jul 2012
Applied Soft Computing Journal | VOL. 12

Voice conversion system using SVM for vocal tract modification and codebook based model for pitch contour modification
R H Laskar ... F A Talukdar
-
R H Laskar, et. al.R H Laskar ... F A Talukdar
01 Nov 2008
01 Nov 2008

Novel approach of MFCC based alignment and WD-residual modification for voice conversion using RBF
Jagannath Nirmal ... Pramod Kachare
Neurocomputing | VOL. 237
Jagannath Nirmal, et. al.Jagannath Nirmal ... Pramod Kachare
27 Aug 2016
Neurocomputing | VOL. 237

Voice Conversion by Mapping the Spectral and Prosodic Features Using Support Vector Machine
Rabul Hussain Laskar ... Saugat Das
-
Rabul Hussain Laskar, et. al.Rabul Hussain Laskar ... Saugat Das
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Voice conversion using General Regression Neural Network

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing