Abstract

Epoch extraction helps in speech enhancement and multispeaker separation from a speech. But it is a challenging task due to time-varying characteristics of the source and the system. Epoch sequence is useful to manipulate prosody in speech synthesis applications. Accurate estimation of epochs helps in characterizing voice quality features. This chapter aims at developing an extraction algorithm independent of the characteristics of vocal tract system. It improves the accuracy of epochs extracted and pitch detected from speech signal. For feature detection, we propose a robust framework derived from Hilbert–Huang transform of speech signal. The intrinsic mode functions (IMF) sharply identify instantaneous frequencies as function of time. The proposed technique guarantees accurate pitch estimation because of better decorrelating nature of HHT compared with DCT and DFTs. The results are simulated for an input speech signal taken from NOISEX-92 database. The simulated results show that the proposed algorithm outperforms the existing methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.