Abstract

We propose a robust formant extraction algorithm that combines the spectral peak picking, formants location examining for peak merger checking, and the root extraction methods. The spectral peak picking method is employed to locate the formant candidates, and the root extraction is used for solving the peak merger problem. The location and the distance between the extracted formants are also utilized to efficiently find out suspected peak mergers. The proposed algorithm does not require much computation, and is shown to be superior to previous formant extraction algorithms through extensive tests using TIMIT speech database.

Highlights

  • The formant is one of the most important features in speech signals,and is used for many applications, such as speech recognition, speech characterization, and synthesis

  • The spectral peak picking methods and their variants have been widely used for a long time because of low computational complexity, but they often seriously suffer from the peak merger problems [1,2,3], where two adjoining formants are identified into a single one

  • The root extraction methods try to find out all the locations of roots by solving a prediction-error polynomial obtained from linear prediction coefficients (LPC), which obviously requires much computation [5]

Read more

Summary

Introduction

The formant is one of the most important features in speech signals,and is used for many applications, such as speech recognition, speech characterization, and synthesis. Previous formant extraction methods can largely be classified into spectral peak picking, root extraction, and analysis by synthesis [1,2,3,4]. The spectral peak picking methods and their variants have been widely used for a long time because of low computational complexity, but they often seriously suffer from the peak merger problems [1,2,3], where two adjoining formants are identified into a single one. The root extraction methods try to find out all the locations of roots by solving a prediction-error polynomial obtained from linear prediction coefficients (LPC), which obviously requires much computation [5]. The accuracy of the root extraction methods can hardly be high because it is not always clear to determine whether a root obtained forms a formant or just shapes the spectrum [5]

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.