Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Shaohui Kuang,António Branco,Deyi Xiong,Junhui Li,Weihua Luo

doi:10.18653/v1/p18-1164

Abstract

In neural machine translation, a source sequence of words is encoded into a vector from which a target sequence is generated in the decoding phase. Differently from statistical machine translation, the associations between source words and their possible target counterparts are not explicitly stored. Source and target words are at the two ends of a long information processing procedure, mediated by hidden states at both the source encoding and the target decoding phases. This makes it possible that a source word is incorrectly translated into a target word that is not any of its admissible equivalent counterparts in the target language. In this paper, we seek to somewhat shorten the distance between source and target words in that procedure, and thus strengthen their association, by means of a method we term bridging source and target word embeddings. We experiment with three strategies: (1) a source-side bridging model, where source word embeddings are moved one step closer to the output target sequence; (2) a target-side bridging model, which explores the more relevant source word embeddings for the prediction of the target sequence; and (3) a direct bridging model, which directly connects source and target word embeddings seeking to minimize errors in the translation of ones by the others. Experiments and analysis presented in this paper demonstrate that the proposed bridging models are able to significantly improve quality of both sentence translation, in general, and alignment and translation of individual source words with target words, in particular.

Highlights

Neural machine translation (NMT) is an endto-end approach to machine translation that has achieved competitive results vis-a-vis statistical machine translation (SMT) on various language pairs (Bahdanau et al, 2015; Cho et al, 2014; Sutskever et al, 2014; Luong and Manning, 2015)
To address the problem illustrated above, we seek to shorten the distance within the seq2seq NMT information processing procedure between source and target word embeddings
As we have presented above three different methods to bridge between source and target word embeddings, in the present section we report on a series of experiments on Chinese to English translation that are undertaken to assess the effectiveness of those bridging methods

Summary

Introduction

Neural machine translation (NMT) is an endto-end approach to machine translation that has achieved competitive results vis-a-vis statistical machine translation (SMT) on various language pairs (Bahdanau et al, 2015; Cho et al, 2014; Sutskever et al, 2014; Luong and Manning, 2015). The NMT seq2seq model incorrectly aligns the target side end of sentence mark eos to 下旬/late with a high attention weight (0.80 in this example) due to the failure of appropriately capturing the similarity, or the lack of it, between the source word 下旬/late and the target eos. To address the problem illustrated above, we seek to shorten the distance within the seq2seq NMT information processing procedure between source and target word embeddings. This is a method we term as bridging, and can be conceived as strengthening the focus of the attention mechanism into more translation-plausible source and target word alignments. Experiments on Chinese-English translation with extensive analysis demonstrate that directly bridging word embeddings at the two ends can produce better word alignments and achieve better translation

Bridging Models

Target-side Bridging Model

Direct Bridging Model

Experiments

Experimental Settings

Experimental Results

Analysis of Word Alignment

Analysis of Long Sentence Translation

Analysis of Over and Under Translation

Analysis of Learned Word Embeddings

Related Work

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 50	License type: cc-by

Similar Papers

Improving Neural Machine Translation with Linear Interpolation of a Short-Path Unit
Yachao Li ... Junhui Li
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19
Yachao Li, et. al.Yachao Li ... Junhui Li
07 Feb 2020
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19

Exploiting Knowledge Graph in Neural Machine Translation
Yu Lu ... Jiajun Zhang
-
Yu Lu, et. al.Yu Lu ... Jiajun Zhang
01 Jan 2019
01 Jan 2019

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

On the Role of Source and Target Words’ Meanings in Metaphorical Conceptualizations
El Mustapha Lemghari
Studies in Logic, Grammar and Rhetoric | VOL. 67
El Mustapha LemghariEl Mustapha Lemghari
01 Dec 2022
Studies in Logic, Grammar and Rhetoric | VOL. 67

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Abstract

Highlights

Summary

Talk to us

Similar Papers