A Unified Joint Model to Deal With Nuisance Variabilities in the i-Vector Space

Waad Ben Kheder,Jean-Francois Bonastre,Driss Matrouf,Moez Ajili

doi:10.1109/taslp.2018.2789399

Abstract

The past decade has witnessed a significant improvement in speaker recognition SR technology in terms of performance with the introduction of the i-vectors framework. Despite these advances, the performance of SR systems considerably suffers in the presence of acoustic nuisances and variabilities. In this paper, we develop a data-driven nuisance compensation technique in the i-vector space without referring to the effects of the targeted nuisances in the temporal domain. This approach is nonparametric as it does not suppose a specific relationship between a “good” version of an i-vector and its corrupted version. Instead, our algorithm models directly the joint distribution of both representations the good i-vector and its corrupted version and takes advantage of the reproducibility of acoustic corruptions to generate the corrupted i-vectors. We then build an MMSE estimator that computes an improved version of a corrupted test i-vector, given this joint distribution. Experiments are carried out on NIST SRE 2010 and speakers in the wild databases where the proposed algorithm is used to deal with additive noise and short utterances. Our technique is shown to be efficient, improving the baseline system performance in terms of equal-error rate by up to 70% when used on known test noises and up to 65% in the context of unseen noises using a generic model. It was also proven efficient in the context of duration mismatch reaching up to 40% of relative improvement when used on short utterances using multiple models corresponding to different durations and up to 36% when used on arbitrary duration test segments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Unified Joint Model to Deal With Nuisance Variabilities in the i-Vector Space

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Mar 1, 2018
Citations: 9

Similar Papers

Fast i-vector denoising using MAP estimation and a noise distributions database for robust speaker recognition
Waad Ben Kheder ... Moez Ajili
Computer Speech & Language | VOL. 45
Waad Ben Kheder, et. al.Waad Ben Kheder ... Moez Ajili
14 Feb 2017
Computer Speech & Language | VOL. 45

Deep neural network based i-vector mapping for speaker verification using short utterances
Jinxi Guo ... Abeer Alwan
Speech Communication | VOL. 105
Jinxi Guo, et. al.Jinxi Guo ... Abeer Alwan
19 Oct 2018
Speech Communication | VOL. 105

Discriminative subspace modeling of SNR and duration variabilities for robust speaker verification
Na Li ... Jen-Tzung Chien
Computer Speech & Language | VOL. 45
Na Li, et. al.Na Li ... Jen-Tzung Chien
13 Apr 2017
Computer Speech & Language | VOL. 45

Dealing with additive noise in speaker recognition systems based on i-vector approach
D Matrouf ... W Ben Kheder
-
D Matrouf, et. al.D Matrouf ... W Ben Kheder
01 Aug 2015
01 Aug 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Unified Joint Model to Deal With Nuisance Variabilities in the i-Vector Space

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing