Speaker identification enhancement under Co-channel conditions using sinusoidal model-based usable speech detection

S.S Khanwalkar,B.Y Smolenski,R.E Yantorno

doi:10.1109/ispacs.2004.1439044

Abstract

The accuracy of present day speaker identification systems (SID) is degraded in adverse acoustical environments. The idea of usable speech is to identify and extract those portions of degraded speech which are considered useful for SID. Recently, a usable speech extraction system was proposed to classify cochannel speech as usable speech and unusable speech for SID. Speech segments can be declared usable based upon a target-to-interferer energy ratio (TIR). By considering only usable speech for SID instead of the corrupted cochannel speech, it is seen that there is an increase in the accuracy. A novel usable speech detection measure using the sinusoidal model of speech and ESPRIT (estimation of signal parameters via rotational invariance technique), spectral estimation is proposed and investigated, which resulted in 82% correct detection of usable speech segments based on TIR. The usable speech frames extracted using ESPRIT when tested with the SID system, resulted in 84% accuracy in detecting speaker identity as compared to using entire co-channel speech which resulted in only 45% accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker identification enhancement under Co-channel conditions using sinusoidal model-based usable speech detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Usable speech measures and their fusion
R.E Yantorno ... B.Y Smolenski
-
R.E Yantorno, et. al.R.E Yantorno ... B.Y Smolenski
25 May 2003
25 May 2003

High accuracy ESPRIT without left-right ambiguity using an acoustic vector sensor array
Zhixiang Yao ... Rui Guo
-
Zhixiang Yao, et. al.Zhixiang Yao ... Rui Guo
01 Jan 2015
01 Jan 2015

Comparison of modelled pursuits with ESPRIT and the matrix pencil method in the modelling of medical percussion signals
Kenneth I Brown ... Jeremy J Wells
Biomedical Signal Processing and Control | VOL. 89
Kenneth I Brown, et. al.Kenneth I Brown ... Jeremy J Wells
03 Dec 2023
Biomedical Signal Processing and Control | VOL. 89

Linear Versus Nonlinear Multi-scale Decomposition for Co-channel Speaker Identification System
Wajdi Ghezaiel ... Ezzedine Ben Braiek
-
Wajdi Ghezaiel, et. al.Wajdi Ghezaiel ... Ezzedine Ben Braiek
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker identification enhancement under Co-channel conditions using sinusoidal model-based usable speech detection

Abstract

Talk to us

Similar Papers