Abstract

A new multiscale voice morphing algorithm using radial basis function (RBF) analysis is presented in this paper. The approach copes well with small training sets of high dimension, which is a problem often encountered in voice morphing. The aim of this algorithm is to transform one person’s speech pattern so that it is perceived as if it was spoken by another speaker. The voice morphing system we propose assumes parallel training data from source and target speakers and uses the theory of wavelets in order to extract speaker feature information. The spectral conversion is modelled using RBF analysis. Independent listener tests demonstrate effective transformation of the perceived speaker identity.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call