A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification

Claude Turner,Anthony Joseph

doi:10.1016/j.procs.2015.09.177

Claude Turner, Anthony Joseph

Open Access

https://doi.org/10.1016/j.procs.2015.09.177

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Abstract One of the most widely used approaches for feature extraction in speaker recognition is the filter bank-based Mel Frequency Cepstral Coefficients (MFCC) approach. The main goal of feature extraction in this context is to extract features from raw speech that captures the unique characteristics of a particular individual. During the feature extraction process, the discrete Fourier transform (DFT) is typically employed to compute the spectrum of the speech waveform. However, over the past few years, the discrete wavelet transform (DWT) has gained remarkable attention, and has been favored over the DFT in a wide variety of applications. The wavelet packet transform (WPT) is an extension of the DWT that adds more flexibility to the decomposition process. This work is a study of the impact on performance, with respect to accuracy and efficiency, when the WPT is used as a substitute for the DFT in the MFCC method. The novelty of our approach lies in its concentration on the wavelet and the decomposition level as the parameters influencing the performance. We compare the performance of the DFT with the WPT, as well as with our previous work using the DWT. It is shown that the WPT results in significantly lower order for the Gaussian Mixture Model (GMM) used to model speech, and marginal improvement in accuracy with respect to the DFT. WPT mirrors DWT in terms of the order of GMM and can perform as well as the DWT under certain conditions.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Journal: Procedia Computer Science	Publication Date: Jan 1, 2015
Citations: 34

Similar Papers

The Wavelet and Fourier Transforms in Feature Extraction for Text-Dependent, Filterbank-Based Speaker Recognition
Claude Turner ... Heather Langdond
Procedia Computer Science | VOL. 6
Claude Turner, et. al.Claude Turner ... Heather Langdond
01 Jan 2010
Procedia Computer Science | VOL. 6

Discrete Wavelet Transform & Linear Prediction Coding Based Method for Speech Recognition via Neural Network
K. Daqrouq ... T. Abu
-
K. Daqrouq, et. al.K. Daqrouq ... T. Abu
12 Sep 2011
12 Sep 2011

Feature Extraction Techniques in Speech Processing: A Survey
Rekha Hibare ... Anup Vibhute
International Journal of Computer Applications | VOL. 107
Rekha Hibare, et. al.Rekha Hibare ... Anup Vibhute
18 Dec 2014
International Journal of Computer Applications | VOL. 107

Two Problems of Wavelet Packet Transform
Makoto Kobayashi ... Kazushi Nakano
-
Makoto Kobayashi, et. al.Makoto Kobayashi ... Kazushi Nakano
01 Apr 2013
01 Apr 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification

Abstract

Published Version

Talk to us

Similar Papers

More From: Procedia Computer Science