Speech frame recognition based on less shift sensitive wavelet filter banks

Hamid Reza Tohidypour,Amin Banitalebi-Dehkordi

doi:10.1007/s11760-015-0787-z

Abstract

The wavelet transform possesses multi-resolution property and high localization performance; hence, it can be optimized for speech recognition. In our previous work, we show that redundant wavelet filter bank parameters work better in speech recognition task, because they are much less shift sensitive than those of critically sampled discrete wavelet transform (DWT). In this paper, three types of wavelet representations are introduced, including features based on dual-tree complex wavelet transform (DT-CWT), perceptual dual-tree complex wavelet transform, and four-channel double-density discrete wavelet transform (FCDDDWT). Then, appropriate filter values for DT-CWT and FCDDDWT are proposed. The performances of the proposed wavelet representations are compared in a phoneme recognition task using special form of the time-delay neural networks. Performance evaluations confirm that dual-tree complex wavelet filter banks outperform conventional DWT in speech recognition systems. The proposed perceptual dual-tree complex wavelet filter bank results in up to approximately 9.82 % recognition rate increase, compared to the critically sampled two-channel wavelet filter bank.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech frame recognition based on less shift sensitive wavelet filter banks

Abstract

Talk to us

Similar Papers

More From: Signal, Image and Video Processing

Lead the way for us

Journal: Signal, Image and Video Processing	Publication Date: Jun 10, 2015
Citations: 3

Similar Papers

A new representation for speech frame recognition based on redundant wavelet filter banks
Hamid Reza Tohidypour ... Hossein Roshandel
Speech Communication | VOL. 54
Hamid Reza Tohidypour, et. al.Hamid Reza Tohidypour ... Hossein Roshandel
08 Sep 2011
Speech Communication | VOL. 54

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Comparison of DWT, DWT-SWT, and DT-CWT for low resolution satellite images enhancement
M Hemalatha ... S Varadarajan
-
M Hemalatha, et. al.M Hemalatha ... S Varadarajan
01 Feb 2017
01 Feb 2017

Automatic ECG arrhythmia classification using dual tree complex wavelet based features
Manu Thomas ... Samit Ari
AEU - International Journal of Electronics and Communications | VOL. 69
Manu Thomas, et. al.Manu Thomas ... Samit Ari
06 Jan 2015
AEU - International Journal of Electronics and Communications | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech frame recognition based on less shift sensitive wavelet filter banks

Abstract

Talk to us

Similar Papers

More From: Signal, Image and Video Processing