A Study for Enhancing Low-resource Thai-Myanmar-English Neural Machine Translation

Mya Ei San,Ye Kyaw Thu,Manabu Okumura,Sasiporn Usanavasin

doi:10.1145/3645111

Abstract

Several methodologies have recently been proposed to enhance the performance of low-resource Neural Machine Translation (NMT). However, these techniques have yet to be explored thoroughly in the low-resource Thai and Myanmar languages. Therefore, we first applied augmentation techniques such as SwitchOut and Ciphertext Based Data Augmentation (CipherDAug) to improve NMT performance in these languages. Second, we enhanced the NMT performance by fine-tuning the pre-trained Multilingual Denoising BART model (mBART), where BART denotes Bidirectional and Auto-Regressive Transformer. We implemented three NMT systems: namely, Transformer+SwitchOut, Multi-Source Transformer+CipherDAug, and fine-tuned mBART in the bidirectional translations of Thai-English-Myanmar language pairs from the ASEAN-MT corpus. Experimental results showed that Multi-Source Transformer+CipherDAug significantly improved Bilingual Evaluation Understudy (BLEU), Character n-gram F-score (ChrF) , and Translation Error Rate (TER) scores over the first baseline Transformer and second baseline Edit-Based Transformer. The model achieved notable BLEU scores: 37.9 (English-to-Thai), 42.7 (Thai-to-English), 28.9 (English-to-Myanmar), 31.2 (Myanmar-to-English), 25.3 (Thai-to-Myanmar), and 25.5 (Myanmar-to-Thai). The fine-tuned mBART model also considerably outperformed the two baselines, except for the Myanmar-to-English pair. SwitchOut improved over the second baseline in all pairs and performed similarly to the first baseline in most cases. Last, we performed detailed analyses verifying that the CipherDAug and mBART models potentially facilitate improving low-resource NMT performance in Thai and Myanmar languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Study for Enhancing Low-resource Thai-Myanmar-English Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Apr 15, 2024
License type: other-oa

Similar Papers

Real-Time Automatic Translation Algorithm for Chinese Subtitles in Media Playback Using Knowledge Base
Li Yan
-
Li YanLi Yan
18 Jun 2022
18 Jun 2022

Deep Learning-Based English-Chinese Translation Research
Yao Huang ... Yi Xin
Advances in Meteorology | VOL. 2022
Yao Huang, et. al.Yao Huang ... Yi Xin
14 Jul 2022
Advances in Meteorology | VOL. 2022

Revisiting Low-Resource Neural Machine Translation: A Case Study
Rico Sennrich ... Biao Zhang
-
Rico Sennrich, et. al.Rico Sennrich ... Biao Zhang
01 Jan 2019
01 Jan 2019

A Parallel Corpora for bi-directional Neural Machine Translation for Low Resourced Ethiopian Languages
Atnafu Lambebo Tonja ... Mesay Gemeda Yigezu
-
Atnafu Lambebo Tonja, et. al.Atnafu Lambebo Tonja ... Mesay Gemeda Yigezu
22 Nov 2021
22 Nov 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Study for Enhancing Low-resource Thai-Myanmar-English Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing