A Novel Approach for Extraction of Features from LP Residual in Time-Domain for Speaker Recognition

A Govardhan,G Chenchamma

doi:10.5120/7605-0611

Abstract

The objective is to model the dominating speakerspecific source in the time-domain at different levels, namely, Subsegmental, segmental and supra-segmental. The speaker-specific source information contained in the LP residual. Hence, LP residual contains different speaker-specific information at different levels. At each level features are extracted using proposed method called Hidden Markov models (HMM) and it is compared with existing Gaussian Mixture model (GMM). The experimental results demonstrates that the performance of Subsegmental level is more than the other two levels. However, the evidences from all the three levels of processing seem to be different and combine well to provide improved performance than the state-of –art speaker recognition system and demonstrating different speaker information captured at each level of processing. Finally, the combined evidence from all the three levels of processing together with vocal tract information further improves the speaker recognition performance. Experiments were conducted on TIMIT database using Gaussian Mixture Models (GMM’s) and Hidden Markov models (HMM’s). Comparing both results the proposed model HMM is better than the existing model GMM.

Full Text