Consistency by Agreement in Zero-Shot Neural Machine Translation

Maruan Al-Shedivat,Ankur Parikh

doi:10.18653/v1/n19-1121

Abstract

Generalization and reliability of multilingual translation often highly depend on the amount of available parallel data for each language pair of interest. In this paper, we focus on zero-shot generalization—a challenging setup that tests models on translation directions they have not been optimized for at training time. To solve the problem, we (i) reformulate multilingual translation as probabilistic inference, (ii) define the notion of zero-shot consistency and show why standard training often results in models unsuitable for zero-shot tasks, and (iii) introduce a consistent agreement-based training method that encourages the model to produce equivalent translations of parallel sentences in auxiliary languages. We test our multilingual NMT models on multiple public zero-shot translation benchmarks (IWSLT17, UN corpus, Europarl) and show that agreement-based learning often results in 2-3 BLEU zero-shot improvement over strong baselines without any loss in performance on supervised translation directions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Consistency by Agreement in Zero-Shot Neural Machine Translation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Multilingual Neural Machine Translation with Auxiliary Source Languages
Weijia Xu ... Shuming Ma
-
Weijia Xu, et. al.Weijia Xu ... Shuming Ma
01 Jan 2020
01 Jan 2020

Improving Multilingual Neural Machine Translation with Auxiliary Source Languages

-

23 Oct 2021
23 Oct 2021

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

High-resource Language-specific Training for Multilingual Neural Machine Translation
Jian Yang ... Yuwei Yin
-
Jian Yang, et. al.Jian Yang ... Yuwei Yin
01 Jul 2022
01 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Consistency by Agreement in Zero-Shot Neural Machine Translation

Abstract

Talk to us

Similar Papers