Abstract

Multiple emotional voice conversion in Vietnamese HMM-based speech synthesis using non-negative matrix factorization

Highlights

  • In many practical applications, TTS with multiple synthesized emotional voices is required while the requirement of having huge amounts data of emotional target voices for training is usually not available

  • State-of-the-art voice conversion (VC) still cannot synthesize target speech while keeping the detail information related to speaker emotions of the target voice

  • We proposed to use the exemplar-based VC using non-negative matrix factorization combined with Hidden Markov Model (HMM)-based TTS to synthesize multiple emotional voices that can keep the detail information related to speaker emotions

Read more

Summary

Introduction

TTS with multiple synthesized emotional voices is required while the requirement of having huge amounts data of emotional target voices for training is usually not available. In this approach, synthesized neutral speech is adapted to target emotional voices with a few amounts of emotional target data. In both HMM-based synthesis and voice adaption, the structures of the estimated spectrum correspond to the average of different speech spectra in the training database due to the use of the mean vector. Using a VC method as a post-processing step for HMM-based TTS is another approach to synthesize multiple emotional target voices. We proposed to use the exemplar-based VC using non-negative matrix factorization combined with HMM-based TTS to synthesize multiple emotional voices that can keep the detail information related to speaker emotions.

Emotions in speech signal
Using non-negative matrix factorization for emotional voice conversion
Exemplar-based emotional voice conversion
Combination between HMM-based TTS and exemplar-based VC
Data corpus
Experimental conditions
Objective measures
Subjective measures
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.