Improved filter bank on multitaper framework for robust Punjabi-ASR system

Virender Kadyan,R K Aggarwal,Archana Mantri

doi:10.1007/s10772-019-09654-1

Abstract

Robustness of the automatic speech recognition (ASR) system relies upon the accuracy of feature extraction and classification in training phase. The mismatch between training and testing conditions during classification of large feature vectors causes a low performance. In this paper, the issue of robustness of acoustic information is addressed for practical Punjabi dataset. Traditional feature extraction approaches: mel frequency cepstral coefficients (MFCC) and gammatone frequency cepstral coefficients (GFCC) face the issue of high variance with leakage of spectral information. Also, handling of the huge number of feature information creates chaos for large speech vocabulary. To overcome this dilemma, a Principal component analysis (PCA) based multi-windowing technique is proposed with the incorporation of baseline GFCC and MFCC based feature approaches after the tuning of taper parameter. The proposed integrated approaches result in better feature vectors, which are further processed using differential evolution + hidden Markov model (DE + HMM) based modelling classifier. The integrated approaches show substantial performance for word recognition as compared to the conventional or fused feature extraction systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved filter bank on multitaper framework for robust Punjabi-ASR system

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Nov 7, 2019
Citations: 8

Similar Papers

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

GFCC based discriminatively trained noise robust continuous ASR system for Hindi language
Mohit Dua ... Mantosh Biswas
Journal of Ambient Intelligence and Humanized Computing | VOL. 10
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
07 May 2018
Journal of Ambient Intelligence and Humanized Computing | VOL. 10

Performance evaluation of Hindi speech recognition system using optimized filterbanks
Mohit Dua ... Mantosh Biswas
Engineering Science and Technology, an International Journal | VOL. 21
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
16 Apr 2018
Engineering Science and Technology, an International Journal | VOL. 21

Optimizing Integrated Features for Hindi Automatic Speech Recognition System
Mohit Dua ... Mantosh Biswas
Journal of Intelligent Systems | VOL. 29
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
01 Oct 2018
Journal of Intelligent Systems | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved filter bank on multitaper framework for robust Punjabi-ASR system

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology