Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Aditya Siddhant,Naveen Arivazhagan,Sneha Kudugunta,Orhan Firat,Yonghui Wu,Mia Chen,Yuan Cao,Ankur Bapna

doi:10.18653/v1/2020.acl-main.252

Abstract

Over the last few years two promising research directions in low-resource neural machine translation (NMT) have emerged. The first focuses on utilizing high-resource languages to improve the quality of low-resource languages via multilingual NMT. The second direction employs monolingual data with self-supervision to pre-train translation models, followed by fine-tuning on small amounts of supervised data. In this work, we join these two lines of research and demonstrate the efficacy of monolingual data with self-supervision in multilingual NMT. We offer three major results: (i) Using monolingual data significantly boosts the translation quality of low-resource languages in multilingual models. (ii) Self-supervision improves zero-shot translation quality in multilingual models. (iii) Leveraging monolingual data with self-supervision provides a viable path towards adding new languages to multilingual models, getting up to 33 BLEU on ro-en translation without any parallel data or back-translation.

Highlights

Recent work has demonstrated the efficacy of multilingual neural machine translation on improving the translation quality of low-resource languages (Firat et al, 2016; Aharoni et al, 2019) as well as zero-shot translation (Ha et al, 2016; Johnson et al, 2017; Arivazhagan et al, 2019b)
The most interesting aspect of this work, is that we introduce a path towards effectively adding new unseen languages to a multilingual neural machine translation (NMT) model, showing strong translation quality on several language pairs by leveraging only monolingual data with self-supervised learning, without the need for any parallel data for the new languages
Low-Resource Translation From Figure 2, we observe that our supervised multilingual NMT model significantly improves the translation quality for most low and medium-resource languages compared with the bilingual baselines

Summary

Introduction

Recent work has demonstrated the efficacy of multilingual neural machine translation (multilingual NMT) on improving the translation quality of low-resource languages (Firat et al, 2016; Aharoni et al, 2019) as well as zero-shot translation (Ha et al, 2016; Johnson et al, 2017; Arivazhagan et al, 2019b). The success of multilingual NMT on low-resource languages relies heavily on transfer learning from high-resource languages for which copious amounts of parallel data is accessible. Compared with multilingual models trained without any monolingual data, our approach shows consistent improvements in the translation quality of all languages, with greater than 10 BLEU points improvements on certain low-resource languages. The most interesting aspect of this work, is that we introduce a path towards effectively adding new unseen languages to a multilingual NMT model, showing strong translation quality on several language pairs by leveraging only monolingual data with self-supervised learning, without the need for any parallel data for the new languages. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2827–2835 July 5 - 10, 2020. c 2020 Association for Computational Linguistics xx cs fr ru zh es fi de et lv lt ro hi kk tr gu

Experimental Setup

Adapting MASS for multilingual models

Datasets

Data Sampling

Architecture and Optimization

Using Monolingual Data for Multilingual NMT

Adding New Languages to Multilingual NMT

Related Work

Conclusion and Future Directions

Findings

A Appendices

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 39	License type: cc-by

Similar Papers

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

Contrastive Learning for Many-to-many Multilingual Neural Machine Translation
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Adapting Multilingual Neural Machine Translation to Unseen Languages
...
-
, et. al. ...
30 Oct 2019
30 Oct 2019

Language relatedness evaluation for multilingual neural machine translation
Chenggang Mi ... Shaoliang Xie
Neurocomputing | VOL. 570
Chenggang Mi, et. al.Chenggang Mi ... Shaoliang Xie
12 Dec 2023
Neurocomputing | VOL. 570

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers