Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA

Xiaomin Pang,Man-Wai Mak

doi:10.1007/s10772-015-9310-8

Abstract

While i-vectors with probabilistic linear discriminant analysis (PLDA) can achieve state-of-the-art performance in speaker verification, the mismatch caused by acoustic noise remains a key factor affecting system performance. In this paper, a fusion system that combines a multi-condition signal-to-noise ratio (SNR)-independent PLDA model and a mixture of SNR-dependent PLDA models is proposed to make speaker verification systems more noise robust. First, the whole range of SNR that a verification system is expected to operate is divided into several narrow ranges. Then, a set of SNR-dependent PLDA models, one for each narrow SNR range, are trained. During verification, the SNR of the test utterance is used to determine which of the SNR-dependent PLDA models is used for scoring. To further enhance performance, the SNR-dependent and SNR-independent models are fused using linear and logistic regression fusion. The performance of the fusion system and the SNR-dependent system is evaluated on the NIST 2012 speaker recognition evaluation for both noisy and clean conditions. Results show that a mixture of SNR-dependent PLDA models perform better in both clean and noisy conditions. It was also found that the fusion system is more robust than the conventional i-vector/PLDA systems under noisy conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Oct 12, 2015
Citations: 15

Similar Papers

Fusion of SNR-dependent PLDA models for noise robust speaker verification
Xiaomin Pang ... Man-Wai Mak
-
Xiaomin Pang, et. al.Xiaomin Pang ... Man-Wai Mak
01 Sep 2014
01 Sep 2014

Large-scale speaker search using PLDA on mismatched conditions
Jeff Ma ... Jan Silovsky
-
Jeff Ma, et. al.Jeff Ma ... Jan Silovsky
01 Apr 2015
01 Apr 2015

Comparison between supervised and unsupervised learning of probabilistic linear discriminant analysis mixture models for speaker verification
Timur Pekhovsky ... Aleksandr Sizov
Pattern Recognition Letters | VOL. 34
Timur Pekhovsky, et. al.Timur Pekhovsky ... Aleksandr Sizov
09 Apr 2013
Pattern Recognition Letters | VOL. 34

Fast Scoring of Full Posterior PLDA Models
Sandro Cumani
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23
Sandro CumaniSandro Cumani
01 Nov 2015
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Noise robust speaker verification via the fusion of SNR-independent and SNR-dependent PLDA

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology