A comparison between phonetic engine and GMM–UBM classifier for language identification tasks

Sushanta Kabir Dutta,L Joyprakash Singh,Tanvira Ismail

doi:10.1007/s00542-020-04858-x

Abstract

In this paper, two automatic language identification (LID) systems are compared. One of the systems is the Hidden Markov Model (HMM) based phonetic engine (PE), and the other is the Gaussian Mixture Model based Universal Background Model (GMM–UBM) classifier. The PE belongs to the category of explicit LID systems while the GMM–UBM classifier falls into the category of implicit LID systems. Ideally, explicit LID requires a segmented and phonetically labelled speech corpus, while the implicit LID systems do not require any phonetic labelling of the data. Both systems are tested here in identifying a set of data belonging to three Indian languages, Manipuri, Assamese and Bengali. The selection of these languages is made due to their wide range of usages in North Eastern India, while at the same time; no proper identification task has been reported so far for a database containing these languages together. The purpose of this comparison is to check the LID efficiency of the relatively new concept of PE with a prevalent identification technique GMM–UBM. In the experiments, it is found that the identification rate (IDR) is more with the PE than that of GMM–UBM system. The average IDR reported with PE is 99% while for the GMM–UBM system it is found to be 96.94% with the same speech corpus being in use. However, the data preparation task is a little more cumbersome and expensive in PE than that of GMM–UBM system. Thus, the compensation for accuracy may be paid with the cost incurred in data preparation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A comparison between phonetic engine and GMM–UBM classifier for language identification tasks

Abstract

Talk to us

Similar Papers

More From: Microsystem Technologies

Lead the way for us

Similar Papers

An Analysis of Automatic Phone Recognition and Identification of a Few Languages from North Eastern India
Sushanta Kabir Dutta ... Lairenlapam Joyprakash Singh
Indian Journal of Science and Technology | VOL. 10
Sushanta Kabir Dutta, et. al.Sushanta Kabir Dutta ... Lairenlapam Joyprakash Singh
01 Feb 2017
Indian Journal of Science and Technology | VOL. 10

Some Issues Related to Phone Recognition and Language Identification Using Phonetic Engine
Sushanta Kabir Dutta ... L Joyprakash Singh
-
Sushanta Kabir Dutta, et. al.Sushanta Kabir Dutta ... L Joyprakash Singh
01 Jan 2018
01 Jan 2018

Optimal prosodic feature extraction and classification in parametric excitation source information for Indian language identification using neural network based Q-learning algorithm
Himanish Shekhar Das ... Pinki Roy
International Journal of Speech Technology | VOL. 22
Himanish Shekhar Das, et. al.Himanish Shekhar Das ... Pinki Roy
03 Dec 2018
International Journal of Speech Technology | VOL. 22

A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features
Chuya China Bhanja ... Rabul Hussain Laskar
Circuits, Systems, and Signal Processing | VOL. 38
Chuya China Bhanja, et. al.Chuya China Bhanja ... Rabul Hussain Laskar
12 Oct 2018
Circuits, Systems, and Signal Processing | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A comparison between phonetic engine and GMM–UBM classifier for language identification tasks

Abstract

Talk to us

Similar Papers

More From: Microsystem Technologies