Comparative Study on Spoken Language Identification Based on Deep Learning

Panikos Heracleous,Akio Yoneyama,Kohichi Takai,Keiji Yasuda,Yasser Mohammad

doi:10.23919/eusipco.2018.8553347

Abstract

Spoken language identification is the process by which the language in a spoken utterance is recognized automatically. Spoken language identification is commonly used in speech translation systems, in multi-lingual speech recognition, and in speaker diarization. In the current paper, spoken language identification based on deep learning (DL) and the i-vector paradigm is presented. Specifically, a comparative study is reported, consisting of experiments on language identification using deep neural networks (DNN) and convolutional neural networks (CNN). Also, the integration of the two methods into a complete system is investigated. Previous studies demonstrated the effectiveness of using DNN in spoken language identification. However, to date, the integration of CNN and i-vectors in language identification has not been investigated. The main advantage of using CNN is that fewer parameters are required compared to DNN. As a result, CNN is cheaper in terms of memory and the computational power needed. The proposed methods are evaluated on the NIST 2015 i-vector Machine Learning Challenge task for the recognition of 50 in-set languages. Using DNN, a 3.55% equal error rate (EER) was achieved. The EER when using CNN was 3.48%. When DNN and CNN systems were fused, an EER of 3.3% was obtained. The results are very promising, and they also show the effectiveness of using CNN and i-vectors in spoken language identification. The proposed methods are compared to a baseline method based on support vector machines (SVM) and they demonstrated significantly superior performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative Study on Spoken Language Identification Based on Deep Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

I-vectors and Deep Convolutional Neural Networks for Language Identification in Clean and Reverberant Environments
Panikos Heracleous ... Yasser Mohammad
-
Panikos Heracleous, et. al.Panikos Heracleous ... Yasser Mohammad
01 Jan 2023
01 Jan 2023

Spoken Language Identification Based on I-vectors and Conditional Random Fields
Panikos Heracleous ... Keiji Yasuda
-
Panikos Heracleous, et. al.Panikos Heracleous ... Keiji Yasuda
01 Jun 2018
01 Jun 2018

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Study on Spoken Language Identification Based on Deep Learning

Abstract

Talk to us

Similar Papers