Including dynamic and phonetic information in voice conversion systems

Antonio Bonafonte,Helenca Duxans,Jan Van Santen,Alexander Kain

doi:10.21437/interspeech.2004-444

Including dynamic and phonetic information in voice conversion systems

Antonio Bonafonte, Helenca Duxans + Show 2 more

https://doi.org/10.21437/interspeech.2004-444

Copy DOI

Publication Date: Oct 4, 2004

Citations: 23

Affiliation: Universitat Politècnica de Catalunya

#Voice Conversion Systems #Dynamic Information + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Voice Conversion (VC) systems modify a speaker voice (source speaker) to be perceived as if another speaker (target speaker) had uttered it. Previous published VC approaches using Gaussian Mixture Models [1] performs the conversion in a frame-by-frame basis using only spectral information. In this paper, two new approaches are studied in order to extend the GMM-based VC systems. First, dynamic information is used to build the speaker acoustic model. So, the transformation is carried out according to sequences of frames. Then, phonetic information is introduced in the training of the VC system. Objective and perceptual results compare the performance of the proposed systems.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.