A new kernel for SVM MLLR based speaker recognition

Zahi N Karam,William M Campbell

doi:10.21437/interspeech.2007-130

Abstract

Speaker recognition using support vector machines (SVMs) with features derived from generative models has been shown to perform well. Typically, a universal background model (UBM) is adapted to each utterance yielding a set of features that are used in an SVM. We consider the case where the UBM is a Gaussian mixture model (GMM), and maximum likelihood linear regression (MLLR) adaptation is used to adapt the means of the UBM. We examine two possible SVM feature expansions that arise in this context: the first, a GMM supervector is constructed by stacking the means of the adapted GMM, and the second consists of the elements of the MLLR transform. We examine several kernels associated with these expansions. We show that both expansions are equivalent given an appropriate choice of kernels. Experiments performed on the NIST SRE 2006 corpus clearly highlight that our choice of kernels, which are motivated by distance metrics between GMMs, outperform ad-hoc ones. We also apply SVM nuisance attribute projection (NAP) to the kernels as a form of channel compensation and show that, with a proper choice of kernel, we achieve results comparable to existing SVM based recognizers. Index Terms: speaker recognition, MLLR, SVM, supervector

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new kernel for SVM MLLR based speaker recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A multi-class MLLR kernel for SVM speaker recognition
Zahi N Karam ... William M Campbell
-
Zahi N Karam, et. al.Zahi N Karam ... William M Campbell
01 Mar 2008
01 Mar 2008

Cluster adaptive training weights as features in SVM-based speaker verification
Hao Yang ... Xianyu Zhao
-
Hao Yang, et. al.Hao Yang ... Xianyu Zhao
27 Aug 2007
27 Aug 2007

Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition
Shan Zhong ... Jia Liu
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E92-A
Shan Zhong, et. al.Shan Zhong ... Jia Liu
01 Jan 2009
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences | VOL. E92-A

Speaker age and gender classification using GMM supervector and NAP channel compensation method
Ergün Yücesoy
Journal of Ambient Intelligence and Humanized Computing | VOL. 13
Ergün YücesoyErgün Yücesoy
13 May 2020
Journal of Ambient Intelligence and Humanized Computing | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new kernel for SVM MLLR based speaker recognition

Abstract

Talk to us

Similar Papers