Comparison of I-vector and GMM-UBM approaches to speaker identification with TIMIT and NIST 2008 databases in challenging environments

Musab T S Al-Kaltakchi,Satnam S Dlay,Wai L Woo,Jonathon A Chambers

doi:10.23919/eusipco.2017.8081264

Musab T S Al-Kaltakchi, Satnam S Dlay + Show 2 more

Open Access

https://doi.org/10.23919/eusipco.2017.8081264

Copy DOI

Publication Date: Aug 1, 2017
Citations: 30	License type: cc-by

Affiliation: Newcastle University, Mustansiriyah University

Abstract

In this paper, two models, the I-vector and the Gaussian Mixture Model-Universal Background Model (GMM-UBM), are compared for the speaker identification task. Four feature combinations of I-vectors with seven fusion techniques are considered: maximum, mean, weighted sum, cumulative, interleaving and concatenated for both two and four features. In addition, an Extreme Learning Machine (ELM) is exploited to identify speakers, and then Speaker Identification Accuracy (SIA) is calculated. Both systems are evaluated for 120 speakers from the TIMIT and NIST 2008 databases for clean speech. Furthermore, a comprehensive evaluation is made under Additive White Gaussian Noise (AWGN) conditions and with three types of Non Stationary Noise (NSN), both with and without handset effects for the TIMIT database. The results show that the I-vector approach is better than the GMM-UBM for both clean and AWGN conditions without a handset. However, the GMM-UBM had better accuracy for NSN types.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of I-vector and GMM-UBM approaches to speaker identification with TIMIT and NIST 2008 databases in challenging environments

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases
Musab T S Al-Kaltakchi ... Mohammed A M Abdullah
Circuits, Systems, and Signal Processing | VOL. 40
Musab T S Al-Kaltakchi, et. al.Musab T S Al-Kaltakchi ... Mohammed A M Abdullah
25 Mar 2021
Circuits, Systems, and Signal Processing | VOL. 40

Evaluation of a speaker identification system with and without fusion using three databases in the presence of noise and handset effects
Musab T S Al-Kaltakchi ... Jonathon A Chambers
EURASIP Journal on Advances in Signal Processing | VOL. 2017
Musab T S Al-Kaltakchi, et. al.Musab T S Al-Kaltakchi ... Jonathon A Chambers
01 Dec 2017
EURASIP Journal on Advances in Signal Processing | VOL. 2017

Wavelet based dynamic Mel Frequency Cepstral Coefficients (MFCC) and block truncation techniques for efficient speaker identification under narrowband noise conditions
...
International Journal of the Physical Sciences | VOL. 8
, et. al. ...
23 Sep 2013
International Journal of the Physical Sciences | VOL. 8

Thorough evaluation of TIMIT database speaker identification performance under noise with and without the G.712 type handset
Musab T S Al-Kaltakchi ... Mohammed A M Abdullah
International Journal of Speech Technology | VOL. 22
Musab T S Al-Kaltakchi, et. al.Musab T S Al-Kaltakchi ... Mohammed A M Abdullah
01 Sep 2019
International Journal of Speech Technology | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of I-vector and GMM-UBM approaches to speaker identification with TIMIT and NIST 2008 databases in challenging environments

Abstract

Talk to us

Similar Papers