Abstract

Generally Speech Recognition Systems are specific to speech/spoken word recognition or Speaker Identification/Verification. In this paper, An attempt has been made to find the better combination of Speech feature extraction and Artificial Neural Network Model for Speaker Identification combined with Spoken word recognition in general noisy back ground (i. e Home/Office environment). Different speech feature extraction techniques such as Mel Frequency cepstarl coefficients (MFCC), Perceptual Linear Prediction (PLP) Cepstral Coefficients and Gammatone Frequency Cepstral Coefficients (GFCC) in combination with two different Neural Network models such as Radial Basis Neural Networks and Learning Vector Quantization Neural Networks have been experimented. Three different test categories such as Spoken word recognition, Speaker Identification, and the combination of both speaker and spoken word recognition have been experimented for the above mentioned combinations. It is Suggested from the experiments that the combination of GFCC and Radial Basis Neural Networks gives the better recognition success rate in general noisy environment.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.