Low-Resource Language Identification From Speech Using Transfer Learning

Kexin Feng,Theodora Chaspari

doi:10.1109/mlsp.2019.8918833

Abstract

Identification of low-resource data is a traditionally difficult machine learning problem, since the sparsity of available resources prevents classifiers from being adequately trained. An effective way to address the inevitable data sparsity in certain applications, such as in low-resource speech language identification, is transfer learning, which uses the knowledge learned from tasks with large labeled data in settings of limited data. Motivated by the fact that various languages share common phonetic and phonotactic characteristics, we explore transfer learning systems that employ various neural network architectures. We leverage readily available large datasets for creating robust instantiations of language identification models using feed-forward neural networks. These are further fine-tuned on the low-resource data from a target domain to improve the system performance. We apply the proposed approach to the automatic identification of African languages, which comprises a challenging task due to the low-resource data from such languages. We conduct our experiments using two publicly available datasets: the VoxForge corpus which contains 7 Indo-European languages as source data, and the Lwazi corpus which includes 11 African languages as target data. Our results indicate the effectiveness of transfer learning for the identification of low-resource languages from speech signals.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low-Resource Language Identification From Speech Using Transfer Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Part-of-Speech Tagging via Deep Neural Networks for Northern-Ethiopic Languages
Jurgita Kapočiūtė-Dzikienė ... Senait Gebremichael Tesfagergish
Information Technology And Control | VOL. 49
Jurgita Kapočiūtė-Dzikienė, et. al.Jurgita Kapočiūtė-Dzikienė ... Senait Gebremichael Tesfagergish
19 Dec 2020
Information Technology And Control | VOL. 49

Transfer Learning, Style Control, and Speaker Reconstruction Loss for Zero-Shot Multilingual Multi-Speaker Text-to-Speech on Low-Resource Languages
Kurniawati Azizah ... Wisnu Jatmiko
IEEE Access | VOL. 10
Kurniawati Azizah, et. al.Kurniawati Azizah ... Wisnu Jatmiko
01 Jan 2021
IEEE Access | VOL. 10

Analytical Review of Methods for Solving Data Scarcity Issues Regarding Elaboration of Automatic Speech Recognition Systems for Low-Resource Languages
Irina Kipyatkova ... Ildar Kagirov
Информатика и автоматизация | VOL. 21
Irina Kipyatkova, et. al.Irina Kipyatkova ... Ildar Kagirov
08 Jul 2022
Информатика и автоматизация | VOL. 21

Identification of Seven Low-Resource North-Eastern Languages: An Experimental Study
Joyanta Basu ... Swanirbhar Majumder
-
Joyanta Basu, et. al.Joyanta Basu ... Swanirbhar Majumder
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low-Resource Language Identification From Speech Using Transfer Learning

Abstract

Talk to us

Similar Papers