On the use of i–vector posterior distributions in Probabilistic Linear Discriminant Analysis

Sandro Cumani,Pietro Laface,Oldrich Plchot

doi:10.1109/taslp.2014.2308473

Sandro Cumani, Pietro Laface + Show 1 more

Open Access

https://doi.org/10.1109/taslp.2014.2308473

Copy DOI

Abstract

The i-vector extraction process is affected by several factors such as the noise level, the acoustic content of the observed features, the channel mismatch between the training conditions and the test data, and the duration of the analyzed speech segment. These factors influence both the i-vector estimate and its uncertainty, represented by the i-vector posterior covariance. This paper presents a new PLDA model that, unlike the standard one, exploits the intrinsic i-vector uncertainty. Since the recognition accuracy is known to decrease for short speech segments, and their length is one of the main factors affecting the i-vector covariance, we designed a set of experiments aiming at comparing the standard and the new PLDA models on short speech cuts of variable duration, randomly extracted from the conversations included in the NIST SRE 2010 extended dataset, both from interviews and telephone conversations. Our results on NIST SRE 2010 evaluation data show that in different conditions the new model outperforms the standard PLDA by more than 10% relative when tested on short segments with duration mismatches, and is able to keep the accuracy of the standard model for long enough speaker segments. This technique has also been successfully tested in the NIST SRE 2012 evaluation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the use of i–vector posterior distributions in Probabilistic Linear Discriminant Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Apr 1, 2014
Citations: 58

Similar Papers

Probabilistic linear discriminant analysis of i-vector posterior distributions
Sandro Cumani ... Pietro Laface
-
Sandro Cumani, et. al.Sandro Cumani ... Pietro Laface
01 May 2013
01 May 2013

Exemplar based language recognition method for short-duration speech segments
Meng-Ge Wang ... Bing Jiang
-
Meng-Ge Wang, et. al.Meng-Ge Wang ... Bing Jiang
01 May 2013
01 May 2013

Fast Scoring of Full Posterior PLDA Models
Sandro Cumani
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23
Sandro CumaniSandro Cumani
01 Nov 2015
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23

Support vector machines for speaker based speech indexing
M H Moattar ... M M Homayounpour
-
M H Moattar, et. al.M H Moattar ... M M Homayounpour
01 Oct 2009
01 Oct 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the use of i–vector posterior distributions in Probabilistic Linear Discriminant Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing