Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition

Shan Zhong,Yuxiang Shan,Jia Liu,Liang He

doi:10.1587/transfun.e92.a.1892

Abstract

One of the most important challenges in speaker recognition is intersession variability (ISV), primarily cross-channel effects. Recent NIST speaker recognition evaluations (SRE) include a multilingual scenario with training conversations involving multilingual speakers collected in a number of other languages, leading to further performance decline. One important reason for this is that more and more researchers are using phonetic clustering to introduce high level information to improve speaker recognition. But such language dependent methods do not work well in multilingual conditions. In this paper, we study both language and channel mismatch using a support vector machine (SVM) speaker recognition system. Maximum likelihood linear regression (MLLR) transforms adapting a universal background model (UBM) are adopted as features. We first introduce a novel language independent statistical binary-decision tree to reduce multi-language effects, and compare this data-driven approach with a traditional knowledge based one. We also construct a framework for channel compensation using feature-domain latent factor analysis (LFA) and MLLR supervector kernel-based nuisance attribute projection (NAP) in the model-domain. Results on the NIST SRE 2006 1conv4w-1conv4w/mic corpus show significant improvement. We also compare our compensated MLLR-SVM system with state-of-the-art cepstral Gaussian mixture and SVM systems, and combine them for a further improvement.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

Lead the way for us

Journal: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences	Publication Date: Jan 1, 2009
License type: free

Similar Papers

A new kernel for SVM MLLR based speaker recognition
Zahi N Karam ... William M Campbell
-
Zahi N Karam, et. al.Zahi N Karam ... William M Campbell
27 Aug 2007
27 Aug 2007

Cluster adaptive training weights as features in SVM-based speaker verification
Hao Yang ... Yuan Dong
-
Hao Yang, et. al.Hao Yang ... Yuan Dong
27 Aug 2007
27 Aug 2007

Sub-vector based biometric speaker verification using MLLR super-vector
A K Sarkar ... J F Bonastre
International Journal of Speech Technology | VOL. 19
A K Sarkar, et. al.A K Sarkar ... J F Bonastre
27 Nov 2015
International Journal of Speech Technology | VOL. 19

MLLR transforms as features in speaker recognition
Andreas Stolcke ... Luciana Ferrer
-
Andreas Stolcke, et. al.Andreas Stolcke ... Luciana Ferrer
04 Sep 2005
04 Sep 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on Intersession Variability Compensation for MLLR-SVM Speaker Recognition

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences