Convolutive ICA-Based Forensic Speaker Identification Using Mel Frequency Cepstral Coefficients and Gaussian Mixture Models

Matheus Silveira,Rafael Sousa Júnior,Celso Oliveira,Cezar Schroeder,João Paulo Costa,Antonio Serrano,Paulo Quintiliano,José Apolinário Junior

doi:10.5769/j201301004

Abstract

Automatic speaker identification techniques are widely used nowadays in forensic applications, but its accuracy harshly drops when the voice of the speaker of interest is immersed in a recording containing more than one voice, common situation of investigations where the targets voice are obtained through ambient recordings. In forensic applications where microphones are hidden, such interferent sound sources in recordings are common and they degrade severely the performance of speaker identification techniques. In this paper, we propose a method to mitigate this problem by spatially separating the voice of each speaker using a Blind Source Separation technique called Convolutive Independent Component Analysis, and then applying the separated speech signals to a speaker identification system based on Mel Frequency Cepstral Coefficients and Gaussian Mixture Models. For identifying more than one speaker, the proposed system has a better accuracy than the state-of-the-art solutions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convolutive ICA-Based Forensic Speaker Identification Using Mel Frequency Cepstral Coefficients and Gaussian Mixture Models

Abstract

Talk to us

Similar Papers

More From: The International Journal of Forensic Computer Science

Lead the way for us

Journal: The International Journal of Forensic Computer Science	Publication Date: Jul 2, 2013
Citations: 14

Similar Papers

Performance Analysis of Speaker Identification using Gaussian Mixture Model and Support Vector Machine
Aman Ranjan Verma ... S Premananda Singh
-
Aman Ranjan Verma, et. al.Aman Ranjan Verma ... S Premananda Singh
01 Nov 2019
01 Nov 2019

Wavelet based dynamic Mel Frequency Cepstral Coefficients (MFCC) and block truncation techniques for efficient speaker identification under narrowband noise conditions
...
International Journal of the Physical Sciences | VOL. 8
, et. al. ...
23 Sep 2013
International Journal of the Physical Sciences | VOL. 8

Skew Gaussian mixture models for speaker recognition
Avi Matza ... Yuval Bistritz
IET Signal Processing | VOL. 8
Avi Matza, et. al.Avi Matza ... Yuval Bistritz
01 Oct 2014
IET Signal Processing | VOL. 8

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu ... Tapan Kumar Basu
Circuits, Systems, and Signal Processing | VOL. 40
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
20 Apr 2021
Circuits, Systems, and Signal Processing | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convolutive ICA-Based Forensic Speaker Identification Using Mel Frequency Cepstral Coefficients and Gaussian Mixture Models

Abstract

Talk to us

Similar Papers

More From: The International Journal of Forensic Computer Science