DWT and LPC based feature extraction methods for isolated word recognition

Navnath S Nehe,Raghunath S Holambe

doi:10.1186/1687-4722-2012-7

Navnath S Nehe, Raghunath S Holambe

Open Access

https://doi.org/10.1186/1687-4722-2012-7

Copy DOI

Abstract

In this article, new feature extraction methods, which utilize wavelet decomposition and reduced order linear predictive coding (LPC) coefficients, have been proposed for speech recognition. The coefficients have been derived from the speech frames decomposed using discrete wavelet transform. LPC coefficients derived from subband decomposition (abbreviated as WLPC) of speech frame provide better representation than modeling the frame directly. The WLPC coefficients have been further normalized in cepstrum domain to get new set of features denoted as wavelet subband cepstral mean normalized features. The proposed approaches provide effective (better recognition rate), efficient (reduced feature vector dimension), and noise robust features. The performance of these techniques have been evaluated on the TI-46 isolated word database and own created Marathi digits database in a white noise environment using the continuous density hidden Markov model. The experimental results also show the superiority of the proposed techniques over the conventional methods like linear predictive cepstral coefficients, Mel-frequency cepstral coefficients, spectral subtraction, and cepstral mean normalization in presence of additive white Gaussian noise.

Highlights

A speech recognition system has two major components, namely, feature extraction and classification
Feature extraction method plays a vital role in speech recognition task
Experimental results This section evaluates the performance of the proposed techniques on isolated words in presence of stationary white noise using TI-46 and own created Marathi databases

Summary

Introduction

A speech recognition system has two major components, namely, feature extraction and classification. Feature extraction method plays a vital role in speech recognition task. There are two dominant approaches of acoustic measurement. First is a temporal domain or parametric approach such as linear prediction [1], which is developed to closely match the resonant structure of human vocal tract that produces the corresponding sound. Linear prediction coefficients (LPC) technique is not suitable for representing speech because it assumes signal stationary within a given frame and not analyze the localized events accurately. Second approach is nonparametric frequency domain approach based on human auditory perception system and known as Mel-frequency cepstral coefficients (MFCC) [3]. The widespread use of the MFCCs is due

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Jan 30, 2012
Citations: 62	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

DWT and LPC based feature extraction methods for isolated word recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Similar Papers

Real-time prediction of upcoming respiratory events via machine learning using snoring sound signal.
Bochun Wang ... Ji Wu
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17
Bochun Wang, et. al.Bochun Wang ... Ji Wu
12 Apr 2021
Journal of clinical sleep medicine : JCSM : official publication of the American Academy of Sleep Medicine | VOL. 17

An Optimized Scheme of Mel Frequency Cepstral Coefficient for Multi-sensor Sign Language Recognition
Nana Wang ... Yichen Tang
-
Nana Wang, et. al.Nana Wang ... Yichen Tang
01 Jan 2017
01 Jan 2017

Comparison of DTW and HMM for isolated word recognition
Sharada C Sajjan ... C Vijaya
-
Sharada C Sajjan, et. al.Sharada C Sajjan ... C Vijaya
01 Mar 2012
01 Mar 2012

A Comparative Study between Artificial Intelligence Techniques in an Automatic Infant's Pain Cry Identification System
Yousra Abdulaziz Mohammed
-
Yousra Abdulaziz MohammedYousra Abdulaziz Mohammed
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DWT and LPC based feature extraction methods for isolated word recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing