Abstract

In this paper we present a novel feature extraction algorithm based on Multitaper windows and Gamma tone filters for robust speaker verification systems in mismatched noisy conditions encountered in forensic area. The idea is to couple the advantage of the low-variance multitaper short term spectral estimators with the acoustic robustness of the auditory Gamma tone filter banks. Experimental results on the TIMIT corpus, with mismatched environment and low environmental signal to noise ratios (SNR) levels, show that the proposed Multitaper Gamma tone Cepstral Coefficient (MGCC) features outperform largely the conventional Mel Frequency Cepstral Coefficients (MFCC) features. Furthermore, and interestingly the proposed features outperforms at almost all the operating signal to noise ratios the recently proposed auditory hearing inspired Gamma tone Frequency Cepstral Coefficient (GFCC) feature for white, babble and factory noises using both the GMM-UBM de facto standard and the state-of-the art I-vector speaker verification systems.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call