Abstract

Speech recognition is a fascinating application of digital signal processing offering unparalleled opportunities. In this paper, a comparative study of different feature extraction techniques like Linear Predictive Coding (LPC), Discrete Wavelet Transforms (DWT) and Wavelet packet Decomposition (WPD) are employed for recognizing speaker independent spoken isolated words. Voice signals are sampled directly from the microphone and then they are processed using these three techniques for extracting the features. Words from Malayalam, one of the four major Dravidian languages of southern India are chosen for recognition. Training, testing and pattern recognition are performed using Artificial Neural Networks (ANN). This work includes three speech recognition methods. First one is a hybrid approach with LPC and ANN, second method uses a combination of DWT and ANN and the third one utilizes a combination of WPD and ANN. Back propagation method is used to train the ANN. The proposed method is implemented for 50 speakers uttering 20 isolated words each. All the three methods produce good recognition accuracy. LPC based method produced an accuracy of 81.20%, DWT gave an accuracy of 90% and WPD produced a recognition accuracy of 87.50%. Thus wavelet based methods are found to be more suitable for recognizing speech because of their multi-resolution characteristics and efficient time frequency localizations. Moreover, wavelet methods have a better capability to model the unvoiced sound details.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.