On integrating a language model into neural machine translation

Caglar Gulcehre,Orhan Firat,Kelvin Xu,Kyunghyun Cho,Yoshua Bengio

doi:10.1016/j.csl.2017.01.014

Abstract

Recent advances in end-to-end neural machine translation models have achieved promising results on high-resource language pairs such as En→ Fr and En→ De. One of the major factor behind these successes is the availability of high quality parallel corpora. We explore two strategies on leveraging abundant amount of monolingual data for neural machine translation. We observe improvements by both combining scores from neural language model trained only on target monolingual data with neural machine translation model and fusing hidden-states of these two models. We obtain up to 2 BLEU improvement over hierarchical and phrase-based baseline on low-resource language pair, Turkish→ English. Our method was initially motivated towards tasks with less parallel data, but we also show that it extends to high resource languages such as Cs→ En and De→ En translation tasks, where we obtain 0.39 and 0.47 BLEU improvements over the neural machine translation baselines, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On integrating a language model into neural machine translation

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Mar 15, 2017
Citations: 105

Similar Papers

Hierarchical Transfer Learning Architecture for Low-Resource Neural Machine Translation
Gongxu Luo ... Zhanheng Chen
IEEE Access | VOL. 7
Gongxu Luo, et. al.Gongxu Luo ... Zhanheng Chen
01 Jan 2019
IEEE Access | VOL. 7

Prevent the Language Model from being Overconfident in Neural Machine Translation
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich ... Alexandra Birch
-
Rico Sennrich, et. al.Rico Sennrich ... Alexandra Birch
01 Jan 2015
01 Jan 2015

Translation Transformers Rediscover Inherent Data Domains
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On integrating a language model into neural machine translation

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language