Abstract

The accuracy of speech recognition systems, to a large extent, depend on the feature sets used for representing the speech data. It has been a continuous process to develop feature sets to perform more accurate speech recognition through ASR (Automatic Speech Recognition) systems. Many feature sets and their different combinations have been tried to achieve better accuracy but the feature set providing completely accurate representation is yet to be formulated. This paper investigates the generation of reduced sets of MFCCs for vowel recognition. The study is more focused on the generation and behavior of MFCCs for different vowel sounds. The goal is to identify the features that enhance their discriminating capabilities and improve the performance of ASR systems for particular sounds. The results of the analysis show that the proposed reduced features perform well and can be further used to improve the accuracy of ASR systems, specifically the ones in resource constrained devices.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.