Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

Musab T S Al-Kaltakchi,Wai L Woo,Satnam S Dlay,Mohammed A M Abdullah

doi:10.1007/s00034-021-01697-7

Abstract

In this article, a novel combined i-vector and an Extreme Learning Machine (ELM) is proposed for speaker identification. The ELM is chosen because it is fast to train and has a universal approximator property. Four combinations of features based on Mel Frequency Cepstral Coefficient and Power Normalized Cepstral Coefficient are used. Besides, seven fusion methods are exploited. The system is evaluated with three different databases, namely: the SITW 2006, NIST 2008, and the TIMIT database. This work employs the 2016 SITW database for the first time for speaker identification using the integration between the ELM and i-vector approach. From each database, 120 speakers with 1200 speech utterances are used (overall 360 speakers with 3600 speech utterances). Furthermore, comprehensive evaluations are exploited with a wide range of realistic background noise types (Stationary noise AWGN and Non-Stationary Noise types) with the handset effect. The proposed system is compared with the Gaussian Mixture Model-Universal Background Model (GMM-UBM) and other states of the art approaches. The results show that the i-vector method outperforms the GMM-UBM approach and other state- of-the-art methods under specific conditions, and that fusion techniques can be used to improve robustness to noise and handset effects.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

Abstract

Talk to us

Similar Papers

More From: Circuits, Systems, and Signal Processing

Lead the way for us

Journal: Circuits, Systems, and Signal Processing	Publication Date: Mar 25, 2021
Citations: 14

Similar Papers

Comparison of I-vector and GMM-UBM approaches to speaker identification with TIMIT and NIST 2008 databases in challenging environments
Musab T S Al-Kaltakchi ... Jonathon A Chambers
-
Musab T S Al-Kaltakchi, et. al.Musab T S Al-Kaltakchi ... Jonathon A Chambers
01 Aug 2017
01 Aug 2017

Speaker verification from codec distorted speech for forensic investigation through serial combination of classifiers
M.S Athulya ... P.S Sathidevi
Digital Investigation | VOL. 25
M.S Athulya, et. al.M.S Athulya ... P.S Sathidevi
31 Mar 2018
Digital Investigation | VOL. 25

Robust Speaker Verification Using Improved PNCC Based on GMM-UBM
Xinxing Jing ... Haiyan Yang
International Journal of Automation and Power Engineering | VOL. 4
Xinxing Jing, et. al.Xinxing Jing ... Haiyan Yang
01 Jan 2015
International Journal of Automation and Power Engineering | VOL. 4

Evaluation of a speaker identification system with and without fusion using three databases in the presence of noise and handset effects
Musab T S Al-Kaltakchi ... Jonathon A Chambers
EURASIP Journal on Advances in Signal Processing | VOL. 2017
Musab T S Al-Kaltakchi, et. al.Musab T S Al-Kaltakchi ... Jonathon A Chambers
01 Dec 2017
EURASIP Journal on Advances in Signal Processing | VOL. 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combined i-Vector and Extreme Learning Machine Approach for Robust Speaker Identification and Evaluation with SITW 2016, NIST 2008, TIMIT Databases

Abstract

Talk to us

Similar Papers

More From: Circuits, Systems, and Signal Processing