Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification

Nengheng Zheng ,Ning Wang ,Tan Lee ,P.c Ching

doi:10.30019/ijclclp.200709.0004

Abstract

This paper describes a speaker identification system that uses complementary acoustic features derived from the vocal source excitation and the vocal tract system. Conventional speaker recognition systems typically adopt the cepstral coefficients, e.g., Mel-frequency cepstral coefficients (MFCC) and linear predictive cepstral coefficients (LPCC), as the representative features. The cepstral features aim at characterizing the formant structure of the vocal tract system. This study proposes a new feature set, named the wavelet octave coefficients of residues (WOCOR), to characterize the vocal source excitation signal. WOCOR is derived by wavelet transformation of the linear predictive (LP) residual signal and is capable of capturing the spectro-temporal properties of vocal source excitation. WOCOR and MFCC contain complementary information for speaker recognition since they characterize two physiologically distinct components of speech production. The complementary contributions of MFCC and WOCOR in speaker identification are investigated. A confidence measure based score-level fusion technique is proposed to take full advantage of these two complementary features for speaker identification. Experiments show that an identification system using both MFCC and WOCOR significantly outperforms one using MFCC only. In comparison with the identification error rate of 6.8% obtained with MFCC-based system, an error rate of 4.1% is obtained with the proposed confidence measure based integrating system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Integration of Complementary Acoustic Features for Speaker Recognition
Nengheng Zheng ... Tan Lee
IEEE Signal Processing Letters | VOL. 14
Nengheng Zheng, et. al.Nengheng Zheng ... Tan Lee
01 Mar 2007
IEEE Signal Processing Letters | VOL. 14

Speaker Verification Using Complementary Information from Vocal Source and Vocal Tract
Nengheng Zheng ... Ning Wang
-
Nengheng Zheng, et. al.Nengheng Zheng ... Ning Wang
01 Jan 2006
01 Jan 2006

Time -frequency analysis of vocal source signal for speaker recognition
Nengheng Zheng ... Tan Lee
-
Nengheng Zheng, et. al.Nengheng Zheng ... Tan Lee
04 Oct 2004
04 Oct 2004

Combining evidences from excitation source and vocal tract system features for Indian language identification using deep neural networks
Mounika Kamsali Veera ... Suryakanth V Gangashetty
International Journal of Speech Technology | VOL. 21
Mounika Kamsali Veera, et. al.Mounika Kamsali Veera ... Suryakanth V Gangashetty
12 Dec 2017
International Journal of Speech Technology | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrating Complementary Features from Vocal Source and Vocal Tract for Speaker Identification

Abstract

Talk to us

Similar Papers