Abstract
We propose a robust formant extraction algorithm that combines the spectral peak picking, formants location examining for peak merger checking, and the root extraction methods. The spectral peak picking method is employed to locate the formant candidates, and the root extraction is used for solving the peak merger problem. The location and the distance between the extracted formants are also utilized to efficiently find out suspected peak mergers. The proposed algorithm does not require much computation, and is shown to be superior to previous formant extraction algorithms through extensive tests using TIMIT speech database.
Highlights
The formant is one of the most important features in speech signals,and is used for many applications, such as speech recognition, speech characterization, and synthesis
The spectral peak picking methods and their variants have been widely used for a long time because of low computational complexity, but they often seriously suffer from the peak merger problems [1,2,3], where two adjoining formants are identified into a single one
The root extraction methods try to find out all the locations of roots by solving a prediction-error polynomial obtained from linear prediction coefficients (LPC), which obviously requires much computation [5]
Summary
The formant is one of the most important features in speech signals,and is used for many applications, such as speech recognition, speech characterization, and synthesis. Previous formant extraction methods can largely be classified into spectral peak picking, root extraction, and analysis by synthesis [1,2,3,4]. The spectral peak picking methods and their variants have been widely used for a long time because of low computational complexity, but they often seriously suffer from the peak merger problems [1,2,3], where two adjoining formants are identified into a single one. The root extraction methods try to find out all the locations of roots by solving a prediction-error polynomial obtained from linear prediction coefficients (LPC), which obviously requires much computation [5]. The accuracy of the root extraction methods can hardly be high because it is not always clear to determine whether a root obtained forms a formant or just shapes the spectrum [5]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.