Fast i-vector denoising using MAP estimation and a noise distributions database for robust speaker recognition

Waad Ben Kheder,Driss Matrouf,Pierre-Michel Bousquet,Jean-François Bonastre,Moez Ajili

doi:10.1016/j.csl.2016.12.007

Waad Ben Kheder, Driss Matrouf + Show 3 more

Open Access

https://doi.org/10.1016/j.csl.2016.12.007

Copy DOI

Journal: Computer Speech & Language	Publication Date: Feb 14, 2017
Citations: 17	License type: other-oa

Affiliation: Laboratoire Informatique d'Avignon

Abstract

Once the i-vector paradigm has been introduced in the field of speaker recognition, many techniques have been proposed to deal with additive noise within this framework. Due to the complexity of its effect in the i-vector space, a lot of effort has been put into dealing with noise in other domains (speech enhancement, feature compensation, robust i-vector extraction and robust scoring). As far as we know, there was no serious attempt to handle the noise problem directly in the i-vector space without relying on data distributions computed on a prior domain. The aim of this paper is twofold. First, it proposes a full-covariance Gaussian modeling of the clean i-vectors and noise distribution in the i-vector space and introduces a technique to estimate a clean i-vector given the noisy version and the noise density function using the MAP approach. Based on NIST data, we show that it is possible to improve by up to 60% the baseline system performance. Second, in order to make this algorithm usable in a real application and reduce the computational time needed by i-MAP, we propose an extension that requires building a noise distribution database in the i-vector space in an off-line step and using it later in the test phase. We show that it is possible to achieve comparable results using this approach (up to 57% of relative EER improvement) with a sufficiently large noise distribution database.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast i-vector denoising using MAP estimation and a noise distributions database for robust speaker recognition

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Similar Papers

Robust Speaker Recognition Using MAP Estimation of Additive Noise in i-vectors Space
Waad Ben Kheder ... Pierre-Michel Bousquet
-
Waad Ben Kheder, et. al.Waad Ben Kheder ... Pierre-Michel Bousquet
01 Jan 2014
01 Jan 2014

A Unified Joint Model to Deal With Nuisance Variabilities in the i-Vector Space
Waad Ben Kheder ... Moez Ajili
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Waad Ben Kheder, et. al.Waad Ben Kheder ... Moez Ajili
01 Mar 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

An Improved VTS Feature Compensation using Mixture Models of Distortion and IVN Training for Noisy Speech Recognition
Jun Du ... Qiang Huo
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Jun Du, et. al.Jun Du ... Qiang Huo
01 Nov 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Average Fisher information optimization for quantized measurements using additive independent noise
Gokce Osman Balkan ... Sinan Gezici
-
Gokce Osman Balkan, et. al.Gokce Osman Balkan ... Sinan Gezici
01 Apr 2010
01 Apr 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast i-vector denoising using MAP estimation and a noise distributions database for robust speaker recognition

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language