GMM based Language Identification using MFCC and SDC Features

Kshirod Sarmah,Utpal Bhattacharjee

doi:10.5120/14840-3103

Abstract

Language Identification (LID) is one of the most popular areas of research in speech signal processing. Now a day's lots of approaches have been used to improve performance of LID system which includes Parallel Phone Recognition Language Modeling (PPRLM), Support Vector Machine (SVM) and general Gaussian Mixture Model (GMM) etc. The state-of-art LID system has been utilised lots of feature vectors like LPCC, MFCC, SDC and prosodic. Although fusion of prosodic features with MFCC features shows some improvement in the performance of the LID system. But still it is not sufficient. In this paper, a baseline system for the LID system in multilingual environments has been developed using GMM as a classifier and MFCC combined with Shifted-Delta- Cepstral (SDC) as front end processing feature vectors. In this works, we used the Arunachali Language Speech Database (ALS-DB), a multilingual and multichannel speech corpus which was recently collected from the four local languages namely Adi, Apatani, Galo and Nyishi in Arunachal Pradesh including Hindi and English as secondary languages.The performance of the LID system has been improved by combing MFCC and SDC features than its individual performances. The minimum ERR rates for the features MFCC and SDC individually are 19.70% and 11.83% respectively while minimum ERR rate for the combined features both MFCC and SDC is 6.40%.Approximately 15.00% and 6.00% of performance of the LID system has been improved while using the combining features of MFCC with SDC over the baseline systems that using MFCC and SDC features in individual respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GMM based Language Identification using MFCC and SDC Features

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Applications

Lead the way for us

Journal: International Journal of Computer Applications	Publication Date: Jan 16, 2014
Citations: 15

Similar Papers

Effective preprocessing of speech and acoustic features extraction for spoken language identification
Abhijeet Kumar ... S Chaturvedi
-
Abhijeet Kumar, et. al.Abhijeet Kumar ... S Chaturvedi
01 May 2015
01 May 2015

Spectral feature based automatic tonal and non-tonal language classification
Alice Celin Alphonsa ... Chuya China Bhanja
-
Alice Celin Alphonsa, et. al.Alice Celin Alphonsa ... Chuya China Bhanja
01 Jul 2017
01 Jul 2017

Language Identification based on Support Vector Machine using GMM Super vectors
Dr A Nagesh*
International Journal of Innovative Technology and Exploring Engineering | VOL. 9
Dr A Nagesh*Dr A Nagesh*
30 Apr 2020
International Journal of Innovative Technology and Exploring Engineering | VOL. 9

Evaluation of Lineal Relation between Shifted Delta Cepstral Features and Prosodic Features in Speaker Verification
José R Calvo ... Gabriel Hernández
-
José R Calvo, et. al.José R Calvo ... Gabriel Hernández
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GMM based Language Identification using MFCC and SDC Features

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Applications