Generative pairwise models for speaker recognition

Sandro Cumani,Pietro Laface

doi:10.21437/odyssey.2014-41

Abstract

This paper proposes a simple model for speaker recognition based on i‐vector pairs, and analyzes its similarity and differences with respect to the state‐of‐the‐art Probabilistic Linear Discriminant Analysis (PLDA) and Pairwise Support Vector Machine (PSVM) models. Similar to the discriminative PSVM approach, we propose a generative model of i‐vector pairs, rather than an usual i‐vector based model. The model is based on two Gaussian distributions, one for the “same speakers” and the other for the “different speakers” i‐vector pairs, and on the assumption that the i‐vector pairs are independent. This independence assumption allows the distributions of the two classes to be independently estimated. The “Two‐Gaussian” approach can be extended to the Heavy‐Tailed distributions, still allowing a fast closed form solution to be obtained for testing i‐vector pairs. We show that this model is closely related to PLDA and to PSVM models, and that tested on the female part of the tel‐ tel NIST SRE 2010 extended evaluation set, it is able to achieve comparable accuracy with respect to the other models, trained with different objective functions and training procedures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generative pairwise models for speaker recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Tackling Age-Invariant Face Recognition With Non-Linear PLDA and Pairwise SVM
Pablo Negri ... Andrea Bottino
IEEE Access | VOL. 9
Pablo Negri, et. al.Pablo Negri ... Andrea Bottino
01 Jan 2020
IEEE Access | VOL. 9

Large-scale speaker search using PLDA on mismatched conditions
Jeff Ma ... Owen Kimball
-
Jeff Ma, et. al.Jeff Ma ... Owen Kimball
01 Apr 2015
01 Apr 2015

Investigating and improving the utility of probabilistic linear discriminant analysis for acoustic signal classification
Yuechi Jiang ... Frank H.F Leung
Digital Signal Processing | VOL. 114
Yuechi Jiang, et. al.Yuechi Jiang ... Frank H.F Leung
15 Apr 2021
Digital Signal Processing | VOL. 114

Joint Estimation of PLDA and Nonlinear Transformations of Speaker Vectors
Sandro Cumani ... Pietro Laface
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 25
Sandro Cumani, et. al.Sandro Cumani ... Pietro Laface
01 Oct 2017
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generative pairwise models for speaker recognition

Abstract

Talk to us

Similar Papers