Abstract

Zero-shot neural machine translation (NMT) is a framework that uses source-pivot and target-pivot parallel data to train a source-target NMT system. An extension to zero-shot NMT is zero-resource NMT, which generates pseudo-parallel corpora using a zero-shot system and further trains the zero-shot system on that data. In this paper, we expand on zero-resource NMT by incorporating monolingual data in the pivot language into training; since the pivot language is usually the highest-resource language of the three, we expect monolingual pivot-language data to be most abundant. We propose methods for generating pseudo-parallel corpora using pivot-language monolingual data and for leveraging the pseudo-parallel corpora to improve the zero-shot NMT system. We evaluate these methods for a high-resource language pair (German-Russian) using English as the pivot. We show that our proposed methods yield consistent improvements over strong zero-shot and zero-resource baselines and even catch up to pivot-based models in BLEU (while not requiring the two-pass inference that pivot models require).

Highlights

  • Neural machine translation (NMT) has achieved impressive results on several high-resource translation tasks (Hassan et al, 2018; Wu et al, 2016)

  • This paper introduced the task of zero-resource neural machine translation using pivot-language monolingual data

  • We introduced a way of generating a pseudo-parallel source↔target training corpus using the monolingual pivot-language corpus, and we showed three ways of leveraging this corpus to train a final source↔target NMT system

Read more

Summary

Introduction

Neural machine translation (NMT) has achieved impressive results on several high-resource translation tasks (Hassan et al, 2018; Wu et al, 2016) These systems have relied on large amounts of parallel training data between the source and the target language; for many language pairs, such data may not be available. Unsupervised NMT systems that learn to translate using only monolingual corpora have been proposed as a solution to this problem (Artetxe et al, 2018; Lample et al, 2018). Such systems do not make full use of available parallel corpora between the source and target languages and a potential pivot language. Pivotbased and zero-shot NMT systems have been proposed as a means of taking advantage of this data to translate between e.g. German and Russian

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call