IITKGP-MLILSC speech database for language identification

Sudhamay Maity,K Sreenivasa Rao,Anil Kumar Vuppala,Dipanjan Nandi

doi:10.1109/ncc.2012.6176831

Abstract

In this paper, we are introducing speech database consists of 27 Indian languages for analyzing language specific information present in speech. In the context of Indian languages, systematic analysis of various speech features and classification models in view of automatic language identification has not performed, because of the lack of proper speech corpus covering majority of the Indian languages. With this motivation, we have initiated the task of developing multilingual speech corpus in Indian languages. In this paper spectral features are explored for investigating the presence of language specific information. Melfrequency cepstral coefficients (MFCCs) and linear predictive cepstral coefficients (LPCCs) are used for representing the spectral information. Gaussian mixture models (GMMs) are developed to capture the language specific information present in spectral features. The performance of language identification system is analyzed in view of speaker dependent and independent cases. The recognition performance is observed to be 96% and 45% respectively, for speaker dependent and independent environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IITKGP-MLILSC speech database for language identification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu ... Tapan Kumar Basu
Circuits, Systems, and Signal Processing | VOL. 40
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
20 Apr 2021
Circuits, Systems, and Signal Processing | VOL. 40

A Novel Approach to Language Identification Using Modified Polynomial Networks
Hemant A Patil ... T K Basu
-
Hemant A Patil, et. al.Hemant A Patil ... T K Basu
01 Jan 2008
01 Jan 2008

Identification of Language using Mel-Frequency Cepstral Coefficients (MFCC)
Shashidhar G Koolagudi ... K Sreenivasa Rao
Procedia Engineering | VOL. 38
Shashidhar G Koolagudi, et. al.Shashidhar G Koolagudi ... K Sreenivasa Rao
01 Jan 2012
Procedia Engineering | VOL. 38

Comparison of MFCC and LPCC for a fixed phrase speaker verification system, time complexity and failure analysis
Songhita Misra ... Tusharkanti Das
-
Songhita Misra, et. al.Songhita Misra ... Tusharkanti Das
01 Mar 2015
01 Mar 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IITKGP-MLILSC speech database for language identification

Abstract

Talk to us

Similar Papers