Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition

Qiongqiong Wang,Kong Aik Lee,Takafumi Koshinaka,Koji Okabe

doi:10.1109/tifs.2023.3287733

Abstract

State-of-the-art speaker recognition systems comprise a speaker embedding front-end followed by a probabilistic linear discriminant analysis (PLDA) back-end. The effectiveness of these components relies on the availability of a large amount of labeled training data. In practice, it is common for domains (e.g., language, channel, demographic) in which a system is deployed to differ from that in which a system has been trained. To close the resulting gap, domain adaptation is often essential for PLDA models. Among two of its variants are Heavy-tailed PLDA (HT-PLDA) and Gaussian PLDA (G-PLDA). Though the former better fits real feature spaces than does the latter, its popularity has been severely limited by its computational complexity and, especially, by the difficulty, it presents in domain adaptation, which results from its non-Gaussian property. Various domain adaptation methods have been proposed for G-PLDA. This paper proposes a generalized framework for domain adaptation that can be applied to both of the above variants of PLDA for speaker recognition. It not only includes several existing supervised and unsupervised domain adaptation methods but also makes possible more flexible usage of available data in different domains. In particular, we introduce here two new techniques: (1) correlation-alignment in the model level, and (2) covariance regularization. To the best of our knowledge, this is the first proposed application of such techniques for domain adaptation w.r.t. HT-PLDA. The efficacy of the proposed techniques has been experimentally validated on NIST 2016, 2018, and 2019 Speaker Recognition Evaluation (SRE’16, SRE’18 and SRE’19) datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Forensics and Security

Lead the way for us

Journal: IEEE Transactions on Information Forensics and Security	Publication Date: Jan 1, 2023
Citations: 6

Similar Papers

A Generalized Framework for Domain Adaptation of PLDA in Speaker Recognition
Qiongqiong Wang ... Takafumi Koshinaka
-
Qiongqiong Wang, et. al.Qiongqiong Wang ... Takafumi Koshinaka
01 May 2020
01 May 2020

Covariance Regularization for Probabilistic Linear Discriminant Analysis
Zhiyuan Peng ... Tan Lee
-
Zhiyuan Peng, et. al.Zhiyuan Peng ... Tan Lee
04 Jun 2023
04 Jun 2023

Comparison between supervised and unsupervised learning of probabilistic linear discriminant analysis mixture models for speaker verification
Timur Pekhovsky ... Aleksandr Sizov
Pattern Recognition Letters | VOL. 34
Timur Pekhovsky, et. al.Timur Pekhovsky ... Aleksandr Sizov
09 Apr 2013
Pattern Recognition Letters | VOL. 34

PCLUDA: A Pseudo-Label Consistency Learning- Based Unsupervised Domain Adaptation Method for Cross-Domain Optical Remote Sensing Image Retrieval
Dongyang Hou ... Siyuan Wang
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61
Dongyang Hou, et. al.Dongyang Hou ... Siyuan Wang
01 Jan 2023
IEEE Transactions on Geoscience and Remote Sensing | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalized Domain Adaptation Framework for Parametric Back-End in Speaker Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Information Forensics and Security