Comparison of feature extraction and normalization methods for speaker recognition using grid-audiovisual database

Musab T S Al-Kaltakchi,Mohanad Abd Shehab,Mohamed A.M Abdullah,Haithem Abd Al-Raheem Taha

doi:10.11591/ijeecs.v18.i2.pp782-789

Abstract

<p><span lang="EN-GB">In this paper, different feature extraction and feature normalization methods are investigated for speaker recognition. With a view to give a good representation of acoustic speech signals, Power Normalized Cepstral Coefficients (PNCCs) and Mel Frequency Cepstral Coefficients (MFCCs) are employed for feature extraction. Then, to mitigate the effect of linear channel, Cepstral Mean-Variance Normalization (CMVN) and feature warping are utilized. The current paper investigates Text-independent speaker identification system by using 16 coefficients from both the MFCCs and PNCCs features. Eight different speakers are selected from the GRID-Audiovisual database with two females and six males. The speakers are modeled using the coupling between the Universal Background Model and Gaussian Mixture Models (GMM-UBM) in order to get a fast scoring technique and better performance. The system shows 100% in terms of speaker identification accuracy. The results illustrated that PNCCs features have better performance compared to the MFCCs features to identify females compared to male speakers. Furthermore, feature wrapping reported better performance compared to the CMVN method. </span></p>

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: May 1, 2020
Citations: 2	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Comparison of feature extraction and normalization methods for speaker recognition using grid-audiovisual database

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Similar Papers

Chapter 7 - Closed-set speaker identification system based on MFCC and PNCC features combination with different fusion strategies
Musab T.S Al-Kaltakchi ... Satnam S Dlay
Applied Speech Processing | VOL. -
Musab T.S Al-Kaltakchi, et. al.Musab T.S Al-Kaltakchi ... Satnam S Dlay
01 Jan 2020
Applied Speech Processing | VOL. -

Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification
M.T.S Al-Kaltakchi ... J A Chambers
-
M.T.S Al-Kaltakchi, et. al.M.T.S Al-Kaltakchi ... J A Chambers
01 Mar 2016
01 Mar 2016

Robust Speaker Verification Using Improved PNCC Based on GMM-UBM
Xinxing Jing ... Haiyan Yang
International Journal of Automation and Power Engineering | VOL. 4
Xinxing Jing, et. al.Xinxing Jing ... Haiyan Yang
01 Jan 2015
International Journal of Automation and Power Engineering | VOL. 4

Wavelet based dynamic Mel Frequency Cepstral Coefficients (MFCC) and block truncation techniques for efficient speaker identification under narrowband noise conditions
...
International Journal of the Physical Sciences | VOL. 8
, et. al. ...
23 Sep 2013
International Journal of the Physical Sciences | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of feature extraction and normalization methods for speaker recognition using grid-audiovisual database

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science