Modulation features for noise robust speaker identification

Vikramjit Mitra,Mitchell Mclaren,Martin Graciarena,Horacio Franco,Nicolas Scheffer

doi:10.21437/interspeech.2013-695

Abstract

Current state-of-the-art speaker identification (SID) systems perform exceptionally well under clean conditions, but their performance deteriorates when noise and channel degradations are introduced. Literature has mostly focused on robust modeling techniques to combat degradations due to background noise and/or channel effects, and have demonstrated significant improvement in SID performance in noise. In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speakeridentification performance. We propose Modulation features of Medium Duration sub-band Speech Amplitudes (MMeDuSA); an acoustic feature motivated by human auditory processing, which is robust to noise corruption and captures speaker stylistic differences. We analyze the performance of MMeDuSA using SRI International’s robust SID system using a channel and noise degraded multilingual corpus distributed through the Defense Advance Research Projects Agency (DARPA) Robust Automatic Transcription of Speech (RATS) program. When benchmarked against standard cepstral features (MFCC) and other noise robust acoustic features, MMeDuSA provided lower SID error rates compared to the others.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modulation features for noise robust speaker identification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification
Seyed Omid Sadjadi ... John H.L Hansen
Speech Communication | VOL. 72
Seyed Omid Sadjadi, et. al.Seyed Omid Sadjadi ... John H.L Hansen
29 May 2015
Speech Communication | VOL. 72

All for one: feature combination for highly channel-degraded speech activity detection
Martin Graciarena ... Benjamin Williams
-
Martin Graciarena, et. al.Martin Graciarena ... Benjamin Williams
25 Aug 2013
25 Aug 2013

Noise robust pitch tracking by subband autocorrelation classification
...
-
, et. al. ...
01 Jan 2012
01 Jan 2012

Acoustic and data-driven features for robust speech activity detection
Samuel Thomas ... Hynek Hermansky
-
Samuel Thomas, et. al.Samuel Thomas ... Hynek Hermansky
09 Sep 2012
09 Sep 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modulation features for noise robust speaker identification

Abstract

Talk to us

Similar Papers