ASR - VLSP 2021: Semi-supervised Ensemble Model for Vietnamese Automatic Speech Recognition

Dang Trung Duc Anh,Dao Dang Huy,Le Duc Cuong,Luu Duc Thanh,Nguyen Duc Tan,Nguyen Thi Thu Trang,Pham Viet Thanh

doi:10.25073/2588-1086/vnucsce.332

Abstract

Automatic speech recognition (ASR) is gaining huge advances with the arrival of End-to-End architectures. Semi-supervised learning methods, which can utilize unlabeled data, have largely contributed to the success of ASR systems, giving them the ability to surpass human performance. However, most of the researches focus on developing these techniques for English speech recognition, which raises concern about their performance in other languages, especially in low-resource scenarios. In this paper, we aim at proposing a Vietnamese ASR system for participating in the VLSP 2021 Automatic Speech Recognition Shared Task. The system is based on the Wav2vec 2.0 framework, along with the application of self-training and several data augmentation techniques. Experimental results show that on the ASR-T1 test set of the shared task, our proposed model achieved a remarkable result, ranked as the second place with a Syllable Error Rate (SyER) of 11.08%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ASR - VLSP 2021: Semi-supervised Ensemble Model for Vietnamese Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

More From: VNU Journal of Science: Computer Science and Communication Engineering

Lead the way for us

Journal: VNU Journal of Science: Computer Science and Communication Engineering	Publication Date: Jun 30, 2022
Citations: 1

Similar Papers

Rapid building of an ASR system for under-resourced languages based on multilingual unsupervised training
Ngoc Thang Vu ... Franziska Kraus
-
Ngoc Thang Vu, et. al.Ngoc Thang Vu ... Franziska Kraus
27 Aug 2011
27 Aug 2011

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition

-

01 Jan 2004
01 Jan 2004

Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling
G Thimmaraja Yadava ... H S Jayanna
International Journal of Speech Technology | VOL. 23
G Thimmaraja Yadava, et. al.G Thimmaraja Yadava ... H S Jayanna
22 Jan 2020
International Journal of Speech Technology | VOL. 23

An Investigation of Multilingual TDNN-BLSTM Acoustic Modeling for Hindi Speech Recognition
Ankit Kumar ... Rajesh Kumar Aggarwal
International Journal of Sensors, Wireless Communications and Control | VOL. 12
Ankit Kumar, et. al.Ankit Kumar ... Rajesh Kumar Aggarwal
01 Jan 2021
International Journal of Sensors, Wireless Communications and Control | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ASR - VLSP 2021: Semi-supervised Ensemble Model for Vietnamese Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

More From: VNU Journal of Science: Computer Science and Communication Engineering