Application of the mutual information minimization to speaker recognition/identification improvement

Jordi Solé-Casals,Marcos Faundez-Zanuy

doi:10.1016/j.neucom.2005.12.023

Abstract

In this paper we propose the inversion of nonlinear distortions in order to improve the recognition rates of a speaker recognizer system. We study the effect of saturations on the test signals, trying to take into account real situations where the training material has been recorded in a controlled situation, but the testing signals present some mismatch with the input signal level (saturations). The experimental results for speaker recognition shows that a combination of several strategies can improve the recognition rates with saturated test sentences from 80% to 89.39%, while the results with clean speech (without saturation) is 87.76% for one microphone, and for speaker identification can reduce the minimum detection cost function with saturated test sentences from 6.42% to 4.15%, while the results with clean speech (without saturation) is 5.74% for one microphone and 7.02% for the other one.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neurocomputing	Publication Date: Jan 31, 2006
Citations: 22	License type: cc0

R Discovery Prime

R Discovery Prime

Application of the mutual information minimization to speaker recognition/identification improvement

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Application of the Mutual Information Minimization to Speaker Recognition / Verification Improvement
Jordi Solé-Casals ... Marcos Faúndez-Zanuy
-
Jordi Solé-Casals, et. al.Jordi Solé-Casals ... Marcos Faúndez-Zanuy
01 Jan 2004
01 Jan 2004

Detecting Replay Attacks Using Single-Channel Audio: The Temporal Autocorrelation of Speech
Shih-Kuang Lee ... Yu Tsao
-
Shih-Kuang Lee, et. al.Shih-Kuang Lee ... Yu Tsao
07 Nov 2022
07 Nov 2022

Replay Speech Detection Based on Dual-Input Hierarchical Fusion Network
Chenlei Hu ... Qingsheng Yuan
Applied Sciences | VOL. 13
Chenlei Hu, et. al.Chenlei Hu ... Qingsheng Yuan
25 Apr 2023
Applied Sciences | VOL. 13

Stereo-based histogram equalization for robust speech recognition
Randa Al-Wakeel ... Magdy Aboul-Ela
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2015
Randa Al-Wakeel, et. al.Randa Al-Wakeel ... Magdy Aboul-Ela
09 Jun 2015
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Application of the mutual information minimization to speaker recognition/identification improvement

Abstract

Talk to us

Similar Papers

More From: Neurocomputing