Parametric representation of excitation source information for language identification

Dipanjan Nandi,Debadatta Pati,K Sreenivasa Rao

doi:10.1016/j.csl.2016.05.001

Abstract

In this work, the linear prediction (LP) residual signal has been parameterized to capture the excitation source information for language identification (LID) study. LP residual signal has been processed at three different levels: sub-segmental, segmental and supra-segmental levels to demonstrate different aspects of language-specific excitation source information. Proposed excitation source features have been evaluated on 27 Indian languages from Indian Institute of Technology Kharagpur-Multi Lingual Indian Language Speech Corpus (IITKGP-MLILSC), Oregon Graduate Institute Multi-Language Telephone-based Speech (OGI-MLTS) and National Institute of Standards and Technology Language Recognition Evaluation (NIST LRE) 2011 corpora. LID systems were developed using Gaussian mixture model (GMM) and i-vector based approaches. Experimental results have shown that segmental level parametric features provide better identification accuracy (62%), compared to sub-segmental (40%) and supra-segmental level (34%) features. Excitation source features obtained from three levels show distinct language-specific evidence. Therefore, the scores from all three levels are combined to obtain the complete excitation source information for the LID task. LID performances achieved from both the excitation source and vocal tract system are compared. Finally, the scores obtained by processing the vocal tract and excitation source features are combined to achieve better improvement in LID accuracy. The best recognition accuracies obtained from stage-IV integrated LID systems I, II and III are 69%, 70% and 72% respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parametric representation of excitation source information for language identification

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Jun 16, 2016
Citations: 13

Similar Papers

Implicit excitation source features for robust language identification
Dipanjan Nandi ... Debadatta Pati
International Journal of Speech Technology | VOL. 18
Dipanjan Nandi, et. al.Dipanjan Nandi ... Debadatta Pati
17 Jun 2015
International Journal of Speech Technology | VOL. 18

Sub-segmental, segmental and supra-segmental analysis of linear prediction residual signal for language identification
Dipanjan Nandi ... K Sreenivasa Rao
-
Dipanjan Nandi, et. al.Dipanjan Nandi ... K Sreenivasa Rao
01 Jul 2014
01 Jul 2014

Implicit processing of LP residual for language identification
Dipanjan Nandi ... K Sreenivasa Rao
Computer Speech & Language | VOL. 41
Dipanjan Nandi, et. al.Dipanjan Nandi ... K Sreenivasa Rao
16 Jun 2016
Computer Speech & Language | VOL. 41

Combining evidences from excitation source and vocal tract system features for Indian language identification using deep neural networks
Mounika Kamsali Veera ... Suryakanth V Gangashetty
International Journal of Speech Technology | VOL. 21
Mounika Kamsali Veera, et. al.Mounika Kamsali Veera ... Suryakanth V Gangashetty
12 Dec 2017
International Journal of Speech Technology | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parametric representation of excitation source information for language identification

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language