Performance evaluation of Hindi speech recognition system using optimized filterbanks

Mohit Dua,Rajesh Kumar Aggarwal,Mantosh Biswas

doi:10.1016/j.jestch.2018.04.005

Abstract

An Automatic Speech Recognition (ASR) system implementation uses a conventional pattern recognition technique that stores a set of training patterns in classes and compares the test patterns with training patterns to place them in the best matched pattern class. Most state-of-the-art ASR systems use Mel Frequency Cepstral Coefficient (MFCC) and Perceptual Linear Prediction (PLP) to extract features in training phase of the ASR system. However, sensitivity of MFCC & PLP to background noise has resulted in use of noise robust features Gammatone Frequency Cepstral Coefficient (GFCC) and Basilar-membrane Frequency-band Cepstral Coefficient (BFCC). But many issues associated with these feature extraction methods, like accepted bandwidth and standard number of filters are unresolved till date. This paper proposes a novel approach to use Differential Evolution (DE) algorithm to optimize the number and spacing of filters used in MFCC, GFCC and BFCC techniques. It also evaluates the performance of the said feature extraction methods with and without DE optimization in clean as well as in noisy environments. The results conclude that BFCC based ASR systems performs 0.4% to 1.0% better than GFCC and 7% to 10% better than MFCC in different conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Engineering Science and Technology, an International Journal	Publication Date: Apr 16, 2018
Citations: 19	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Performance evaluation of Hindi speech recognition system using optimized filterbanks

Abstract

Talk to us

Similar Papers

More From: Engineering Science and Technology, an International Journal

Lead the way for us

Similar Papers

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

GFCC based discriminatively trained noise robust continuous ASR system for Hindi language
Mohit Dua ... Mantosh Biswas
Journal of Ambient Intelligence and Humanized Computing | VOL. 10
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
07 May 2018
Journal of Ambient Intelligence and Humanized Computing | VOL. 10

Optimizing Integrated Features for Hindi Automatic Speech Recognition System
Mohit Dua ... Mantosh Biswas
Journal of Intelligent Systems | VOL. 29
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
01 Oct 2018
Journal of Intelligent Systems | VOL. 29

Discriminative Training Using Noise Robust Integrated Features and Refined HMM Modeling
Mohit Dua ... Mantosh Biswas
Journal of Intelligent Systems | VOL. 29
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
20 Feb 2018
Journal of Intelligent Systems | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance evaluation of Hindi speech recognition system using optimized filterbanks

Abstract

Talk to us

Similar Papers

More From: Engineering Science and Technology, an International Journal