Spectral feature based automatic tonal and non-tonal language classification

Alice Celin Alphonsa,Azharuddin Laskar,Rabul Hussain Laskar,Chuya China Bhanja

doi:10.1109/icicict1.2017.8342752

Abstract

A Language Identification (LID) System finds out the language of a given speech utterance. Languages can be divided into tonal and non-tonal categories based on whether the meaning of the same word will change or not with the change in pitch variation. Classifying languages into tonal and non-tonal categories before the individual language identification stage will reduce the complexity of the LID system. Though state of the art systems use prosodic features for this purpose, this work is focused on analysing the performance of spectral features for tonal and non-tonal classification of languages. Performance analysis of different spectral feature combinations namely, Mel Frequency Cepstral Coefficients (MFCC), MFCC along with Shifted Delta Cepstral (SDC) Coefficients, Mean Hilbert Envelope Coefficients (mHeC) and MHEC along with SDC Coefficients is carried out in this study. Experiments have been performed on Oregon Graduate Institute-Multilingual Telephone Speech Corpus (OGI-MLTS) and NITS Language database using GMM-UBM modelling technique. Results show that MHEC with SDC and MFCC with SDC features, at syllabic level, give comparable performance of 33.97% Equal Error Rate (EER) for this classification task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spectral feature based automatic tonal and non-tonal language classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

GMM based Language Identification using MFCC and SDC Features
Kshirod Sarmah ... Utpal Bhattacharjee
International Journal of Computer Applications | VOL. 85
Kshirod Sarmah, et. al.Kshirod Sarmah ... Utpal Bhattacharjee
16 Jan 2014
International Journal of Computer Applications | VOL. 85

Effective preprocessing of speech and acoustic features extraction for spoken language identification
Abhijeet Kumar ... S Chaturvedi
-
Abhijeet Kumar, et. al.Abhijeet Kumar ... S Chaturvedi
01 May 2015
01 May 2015

Multilingual Speech Corpus in Low-Resource Eastern and Northeastern Indian Languages for Speaker and Language Identification
Joyanta Basu ... Tapan Kumar Basu
Circuits, Systems, and Signal Processing | VOL. 40
Joyanta Basu, et. al.Joyanta Basu ... Tapan Kumar Basu
20 Apr 2021
Circuits, Systems, and Signal Processing | VOL. 40

A hierarchical language identification system for Indian languages
S. Jothilakshmi ... V. Ramalingam
Digital Signal Processing | VOL. 22
S. Jothilakshmi, et. al.S. Jothilakshmi ... V. Ramalingam
27 Jan 2012
Digital Signal Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spectral feature based automatic tonal and non-tonal language classification

Abstract

Talk to us

Similar Papers