Polygon-Net: A General Framework for Jointly Boosting Multiple Unsupervised Neural Machine Translation Models

Chang Xu,Tie-Yan Liu,Gang Wang,Tao Qin

doi:10.24963/ijcai.2019/739

Abstract

Neural machine translation (NMT) has achieved great success. However, collecting large-scale parallel data for training is costly and laborious. Recently, unsupervised neural machine translation has attracted more and more attention, due to its demand for monolingual corpus only, which is common and easy to obtain, and its great potentials for the low-resource or even zero-resource machine translation. In this work, we propose a general framework called Polygon-Net, which leverages multi auxiliary languages for jointly boosting unsupervised neural machine translation models. Specifically, we design a novel loss function for multi-language unsupervised neural machine translation. In addition, different from the literature that just updating one or two models individually, Polygon-Net enables multiple unsupervised models in the framework to update in turn and enhance each other for the first time. In this way, multiple unsupervised translation models are associated with each other for training to achieve better performance. Experiments on the benchmark datasets including UN Corpus and WMT show that our approach significantly improves over the two-language based methods, and achieves better performance with more languages introduced to the framework.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Polygon-Net: A General Framework for Jointly Boosting Multiple Unsupervised Neural Machine Translation Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems
Benjamin Marie ... Atsushi Fujita
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19
Benjamin Marie, et. al.Benjamin Marie ... Atsushi Fujita
01 Jun 2020
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19

Unsupervised English Intelligent Machine Translation in Wireless Network Environment
Bing Zhang
Security and Communication Networks | VOL. 2022
Bing ZhangBing Zhang
21 May 2022
Security and Communication Networks | VOL. 2022

Unsupervised Pivot Translation for Distant Languages
Yichong Leng ... Tao Qin
-
Yichong Leng, et. al.Yichong Leng ... Tao Qin
01 Jan 2019
01 Jan 2019

Language Model Pre-training Method in Machine Translation Based on Named Entity Recognition
Zhen Li ... Chaojie Xie
International Journal on Artificial Intelligence Tools | VOL. 29
Zhen Li, et. al.Zhen Li ... Chaojie Xie
30 Nov 2020
International Journal on Artificial Intelligence Tools | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Polygon-Net: A General Framework for Jointly Boosting Multiple Unsupervised Neural Machine Translation Models

Abstract

Talk to us

Similar Papers