Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results

Ashkan Moradi,Yasser Shekofteh

doi:10.1016/j.compeleceng.2022.108549

Abstract

Identification of the spoken languages in an audio file is performed automatically using the spoken language identification (LID) process. In this paper, we proposed a genetic-based fusion method to combine the score probabilities of an x-vector-based acoustic LID (ALID) and a phonetic LID (PLID) system. The ALID system is based on an LDA classifier able to identify different languages using x-vectors, while the PLID system is based on an SVM classifier which takes into account perplexities as its feature vector, which are derived from phone language models utilizing a universal phone recognizer named Allosaurus. With the help of genetic-based fusion, 54 weights will be extracted. Having 27 languages in our database and two different LID systems results in 54 weights for our fusion. The individual results of our acoustic and phonetic LID systems are eventually combined by applying these weights. Based on the experimental results on 27 languages from the NIST-LRE09 database, the fusion of the acoustic system and the phonetic system results in 93.30% accuracy, which has approximately a 21% reduction in identification error to our best baseline system with 91.50% accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering

Lead the way for us

Journal: Computers and Electrical Engineering	Publication Date: Dec 21, 2022
Citations: 1

Similar Papers

Spoken language identification with phonological and lexical models
Shubha Kadambe ... James L. Hieronymus
The Journal of the Acoustical Society of America | VOL. 95
Shubha Kadambe, et. al.Shubha Kadambe ... James L. Hieronymus
01 May 1994
The Journal of the Acoustical Society of America | VOL. 95

Spoken Language Identification with Deep Temporal Neural Network and Multi-levels Discriminative Cues
Linjia Sun
-
Linjia SunLinjia Sun
01 Sep 2020
01 Sep 2020

Statistical language identification based on untranscribed training
M.A Lund ... H Gish
-
M.A Lund, et. al.M.A Lund ... H Gish
07 May 1996
07 May 1996

Language identification with phonological and lexical models
S Kadambe ... J.L Hieronymus
-
S Kadambe, et. al.S Kadambe ... J.L Hieronymus
10 Oct 1995
10 Oct 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spoken language identification using a genetic-based fusion approach to combine acoustic and universal phonetic results

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering