Fine-Tuning Self-Supervised Multilingual Sequence-To-Sequence Models for Extremely Low-Resource NMT

Sarubi Thillainathan,Surangika Ranathunga,Sanath Jayasena

doi:10.1109/mercon52712.2021.9525720

Abstract

Neural Machine Translation (NMT) tends to perform poorly in low-resource language settings due to the scarcity of parallel data. Instead of relying on inadequate parallel corpora, we can take advantage of monolingual data available in abundance. Training a denoising self-supervised multilingual sequence-to-sequence model by noising the available large scale monolingual corpora is one way to utilize monolingual data. For a pair of languages for which monolingual data is available in such a pre-trained multilingual denoising model, the model can be fine-tuned with a smaller amount of parallel data from this language pair. This paper presents fine-tuning self-supervised multilingual sequence-to-sequence pre-trained models for extremely low-resource domain-specific NMT settings. We choose one such pre-trained model: mBART. We are the first to implement and demonstrate the viability of non-English centric complete fine-tuning on multilingual sequence-to-sequence pre-trained models. We select Sinhala, Tamil and English languages to demonstrate fine-tuning on extremely low-resource settings in the domain of official government documents. Experiments show that our fine-tuned mBART model significantly outperforms state-of-the-art Transformer based NMT models in all pairs in all six bilingual directions, where we report a 4.41 BLEU score increase on Tamil→Sinhala and a 2.85 BLUE increase on Sinhala→ Tamil translation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fine-Tuning Self-Supervised Multilingual Sequence-To-Sequence Models for Extremely Low-Resource NMT

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich ... Alexandra Birch
-
Rico Sennrich, et. al.Rico Sennrich ... Alexandra Birch
01 Jan 2015
01 Jan 2015

Pre-Training on Mixed Data for Low-Resource Neural Machine Translation
Wenbo Zhang ... Yating Yang
Information | VOL. 12
Wenbo Zhang, et. al.Wenbo Zhang ... Yating Yang
18 Mar 2021
Information | VOL. 12

Spanish-Turkish Low-Resource Machine Translation: Unsupervised Learning vs Round-Tripping
Tianyi Xu ... Shannon Marks
American Journal of Artificial Intelligence | VOL. 4
Tianyi Xu, et. al.Tianyi Xu ... Shannon Marks
01 Jan 2020
American Journal of Artificial Intelligence | VOL. 4

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation
Aditya Siddhant ... Ankur Bapna
-
Aditya Siddhant, et. al.Aditya Siddhant ... Ankur Bapna
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fine-Tuning Self-Supervised Multilingual Sequence-To-Sequence Models for Extremely Low-Resource NMT

Abstract

Talk to us

Similar Papers