Improving Tibetan End-To-End Speech Recognition with Transfer Learning

Yunpeng Li,Qinya Zhang,Guanyu Li

doi:10.1088/1742-6596/2560/1/012050

Yunpeng Li, Qinya Zhang + Show 1 more

Open Access

https://doi.org/10.1088/1742-6596/2560/1/012050

Copy DOI

Abstract

End-to-end architecture has shown outstanding performance in the field of speech recognition, but achieving such performance typically requires a large amount of annotated data. In languages with sufficient corpus and resources, satisfactory recognition results have been achieved. However, for some low-resource data with insufficient training data, the lack of training data remains a bottleneck in building speech recognition systems. This paper presents an approach that utilizes self-supervised feature extraction and transfer learning to enhance the performance of acoustic models in low-resource languages. The proposed strategy involves retraining the fundamental acoustic model, originally trained on resource-rich languages, using a limited amount of low-resource speech data within an end-to-end architecture. By doing so, it aims to construct an improved acoustic model tailored for low-resource languages. The results of the model in the Tibetan language dataset show significant improvement, with a relative decrease of 26% in word error rate from 13.8% to 10.2% on the test set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Tibetan End-To-End Speech Recognition with Transfer Learning

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Journal: Journal of Physics: Conference Series	Publication Date: Aug 1, 2023
License type: cc-by

Similar Papers

An Investigation of Multilingual TDNN-BLSTM Acoustic Modeling for Hindi Speech Recognition
Ankit Kumar ... Rajesh Kumar Aggarwal
International Journal of Sensors, Wireless Communications and Control | VOL. 12
Ankit Kumar, et. al.Ankit Kumar ... Rajesh Kumar Aggarwal
01 Jan 2021
International Journal of Sensors, Wireless Communications and Control | VOL. 12

Semi-Supervised Transfer Learning for Language Expansion of End-to-End Speech Recognition Models to Low-Resource Languages
Jiyeon Kim ... Abhinav Garg
-
Jiyeon Kim, et. al.Jiyeon Kim ... Abhinav Garg
13 Dec 2021
13 Dec 2021

Building Acoustic and Language Model for Continuous Speech Recognition in Bahasa Indonesia
Andreas Widjaja ... Vincent Elbert Budiman
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6
Andreas Widjaja, et. al.Andreas Widjaja ... Vincent Elbert Budiman
10 Aug 2020
Jurnal Teknik Informatika dan Sistem Informasi | VOL. 6

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Tibetan End-To-End Speech Recognition with Transfer Learning

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series