Robust Spectral Features for Automatic Speaker Recognition in Mismatch Condition

Sharada V Chougule,Mahesh S Chavan

doi:10.1016/j.procs.2015.08.021

Abstract

Abstract The widespread use of automatic speaker recognition technology in real world applications demands for robustness against various realistic conditions. In this paper, a robust spectral feature set, called NDSF (Normalized Dynamic Spectral Features) is proposed for automatic speaker recognition in mismatch condition. Magnitude spectral subtraction is performed on spectral features for compensation against additive noise. A spectral domain modification is further performed using time-difference approach followed by Gaussianization Non-linearity. Histogram normalization is applied to these dynamic spectral features, to compensate the effect of channel mismatch and some non-linear effects introduced due to handset transducers. Feature extraction using proposed features is carried out for a text independent automatic speaker recognition (identification) system. The performance of proposed feature set is compared with conventional cepstral features like (mel-frequency cepstral coefficients and linear prediction cepstral coefficients), for acoustic mismatch condition caused by use of different sensors. Studies are performed on two databases: A multi-variability speaker recognition (MVSR) developed by IIT-Guwahati and Multi-speaker continuous (Hindi) speech database (By Department of Information Technology, Government of India). From experimental analysis, it is observed that, spectral domain dynamic features enhance the robustness by reducing additive noise and channel effects caused by sensor mismatch. The proposed NDSF features are found to be more robust than cepstral features for both datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Procedia Computer Science	Publication Date: Jan 1, 2015
Citations: 14	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Robust Spectral Features for Automatic Speaker Recognition in Mismatch Condition

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science

Lead the way for us

Similar Papers

Speaker recognition utilizing distributed DCT-II based Mel frequency cepstral coefficients and fuzzy vector quantization
M Afzal Hossan ... Mark A Gregory
International Journal of Speech Technology | VOL. 16
M Afzal Hossan, et. al.M Afzal Hossan ... Mark A Gregory
28 Jun 2012
International Journal of Speech Technology | VOL. 16

Identification of Speakers from Their Hum
Hemant A Patil ... Robin Jain
-
Hemant A Patil, et. al.Hemant A Patil ... Robin Jain
08 Sep 2008
08 Sep 2008

A Novel Approach to Identification of Speakers from Their Hum
Hemant A Patil ... Robin Jain
-
Hemant A Patil, et. al.Hemant A Patil ... Robin Jain
01 Feb 2009
01 Feb 2009

A Comparison of MFCC and LPCC with Deep Learning for Speaker Recognition
Haiyan Yang ... Yanrong Deng
-
Haiyan Yang, et. al.Haiyan Yang ... Yanrong Deng
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Spectral Features for Automatic Speaker Recognition in Mismatch Condition

Abstract

Talk to us

Similar Papers

More From: Procedia Computer Science