Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification

Achintya Kumar Sarkar ,Cong-Thanh Do ,Viet T Le ,Claude Barras

doi:10.1109/lsp.2014.2323432

Abstract

Most speaker recognition systems rely on short-term acoustic cepstral features for extracting the speaker-relevant information from the signal. But phonetic discriminant features, extracted by a bottle-neck multi-layer perceptron (MLP) on longer stretches of time, can provide a complementary information and have been adopted in speech transcription systems. We compare the speaker verification performance using cepstral features, discriminant features, and a concatenation of both followed by a dimension reduction. We consider two speaker recognition systems, one based on maximum likelihood linear regression (MLLR) super-vectors and the other on a state-of-the-art i-vector system with two session variability compensation schemes. Experiments are reported on a standard configuration of NIST SRE 2008 and 2010 databases. The results show that the phonetically discriminative MLP features retain speaker-specific information which is complementary to the short-term cepstral features. The performance improvement is obtained with both score domain and feature domain fusion and the speaker verification equal error rate (EER) is reduced up to 50% relative, compared to the best i-vector system using only cepstral features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Journal: IEEE Signal Processing Letters	Publication Date: Sep 1, 2014
Citations: 29

Similar Papers

Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data
Cong-Thanh Do ... A K Sarkar
-
Cong-Thanh Do, et. al.Cong-Thanh Do ... A K Sarkar
25 Aug 2013
25 Aug 2013

Boosting Localized Features for Speaker and Speech Recognition

-

01 Jan 2010
01 Jan 2010

A study on the roles of total variability space and session variability modeling in speaker recognition
A K Sarkar ... J F Bonastre
International Journal of Speech Technology | VOL. 19
A K Sarkar, et. al.A K Sarkar ... J F Bonastre
07 Dec 2015
International Journal of Speech Technology | VOL. 19

MLLR transforms of self-organized units as features in speaker recognition
Man-Hung Siu ... Herbert Gish
-
Man-Hung Siu, et. al.Man-Hung Siu ... Herbert Gish
01 Mar 2012
01 Mar 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters