Abstract
In this chapter, two hybrid source modeling methods are proposed for improving the quality of HMM-based speech synthesis. In the first method, the optimal pitch-synchronous residual frames which represent the excitation signals of phones are used for modeling the source. In the second method, a hybrid source model which is capable of generating the excitation signal specific to every phone is proposed. Initially, an analysis of phone-dependent characteristics of the excitation signal is performed. In the proposed source model, the pitch-synchronous residual frames of a phone are modeled as a sum of deterministic and noise components.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.