Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems

Benjamin Marie,Atsushi Fujita

doi:10.1145/3389790

Abstract

Recent work achieved remarkable results in training neural machine translation (NMT) systems in a fully unsupervised way, with new and dedicated architectures that only rely on monolingual corpora. However, previous work also showed that unsupervised statistical machine translation (USMT) performs better than unsupervised NMT (UNMT), especially for distant language pairs. To take advantage of the superiority of USMT over UNMT, and considering that SMT suffers from well-known limitations overcome by NMT, we propose to define UNMT as NMT trained with the supervision of synthetic parallel data generated by USMT. This way we can exploit USMT up to its limits while ultimately relying on full-fledged NMT models to generate translations. We show significant improvements in translation quality over previous work and also that further improvements can be obtained by alternatively and iteratively training USMT and UNMT. Without the need of a dedicated architecture for UNMT, our simple approach can straightforwardly benefit from any recent and future advances in supervised NMT. Our systems achieve a new state-of-the-art for unsupervised machine translation in all of our six translation tasks for five diverse language pairs, surpassing even supervised SMT or NMT in some tasks. Furthermore, our analysis shows how crucial the comparability between the monolingual corpora used for unsupervised training is in improving translation quality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Jun 1, 2020
Citations: 2

Similar Papers

Baidu Translate: Research and Products
Zhongjun He
-
Zhongjun HeZhongjun He
01 Jan 2015
01 Jan 2015

Language Model Pre-training Method in Machine Translation Based on Named Entity Recognition
Zhen Li ... Chaojie Xie
International Journal on Artificial Intelligence Tools | VOL. 29
Zhen Li, et. al.Zhen Li ... Chaojie Xie
30 Nov 2020
International Journal on Artificial Intelligence Tools | VOL. 29

Factors Behind the Effectiveness of an Unsupervised Neural Machine Translation System between Korean and Japanese
Yong-Seok Choi ... Kong-Joo Lee
Applied Sciences | VOL. 11
Yong-Seok Choi, et. al.Yong-Seok Choi ... Kong-Joo Lee
21 Aug 2021
Applied Sciences | VOL. 11

Adaptation in Statistical Machine Translation for Low-resource Domains in English-Vietnamese Language
Nghia-Luan Pham ... Van-Vinh Nguyen
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Nghia-Luan Pham, et. al.Nghia-Luan Pham ... Van-Vinh Nguyen
30 May 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing