Modeling Source Syntax for Neural Machine Translation

Junhui Li,Deyi Xiong,Guodong Zhou,Zhaopeng Tu,Muhua Zhu,Min Zhang

doi:10.18653/v1/p17-1064

Abstract

Even though a linguistics-free sequence to sequence model in neural machine translation (NMT) has certain capability of implicitly learning syntactic information of source sentences, this paper shows that source syntax can be explicitly incorporated into NMT effectively to provide further improvements. Specifically, we linearize parse trees of source sentences to obtain structural label sequences. On the basis, we propose three different sorts of encoders to incorporate source syntax into NMT: 1) Parallel RNN encoder that learns word and label annotation vectors parallelly; 2) Hierarchical RNN encoder that learns word and label annotation vectors in a two-level hierarchy; and 3) Mixed RNN encoder that stitchingly learns word and label annotation vectors over sequences where words and labels are mixed. Experimentation on Chinese-to-English translation demonstrates that all the three proposed syntactic encoders are able to improve translation accuracy. It is interesting to note that the simplest RNN encoder, i.e., Mixed RNN encoder yields the best performance with an significant improvement of 1.4 BLEU points. Moreover, an in-depth analysis from several perspectives is provided to reveal how source syntax benefits NMT.

Highlights

The sequence to sequence model in neural machine translation (NMT) has achieved certain success over the state-ofthe-art of statistical machine translation (SMT)NP1 VV NP2 input: ӳՂ ᦤԻಅ ಢ‫ اٴ‬ෛኞ ᱷᤈ ኩ᧗ Ӥ૱ ຫນ output: tokoyo stock exchange approves new listing bank reference: tokyo exchange approves shinsei bank 's application for listing (a)
We have investigated whether and how source syntax can explicitly help NMT to improve its translation accuracy
Experimentation on Chinese-to-English translation shows that all proposed models yield improvements over a state-ofthe-art baseline NMT system

Summary

Introduction

The sequence to sequence model (seq2seq) in neural machine translation (NMT) has achieved certain success over the state-ofthe-art of statistical machine translation (SMT). Shi et al (2016) show that the seq2seq model still fails to capture a lot of deep structural details, even though it is capable of learning certain implicit source syntax from sentence-aligned parallel corpus. It requires an additional parsing-task-specific training mechanism to recover the hidden syntax in NMT. In the absence of explicit linguistic knowledge, the seq2seq model in NMT tends to produce translations that fail to well respect syntax. Statistics on our development set show that one forth of Chinese noun phrases are translated into discontinuous phrases in English, indicating the substantial disrespect of syntax in NMT translation.. Experimentation on Chinese-to-English translation demonstrates that all proposed approaches are able to improve the translation accuracy

Attention-based NMT

NMT with Source Syntax

Syntax Representation

Iove ew3

RNN Encoders with Source Syntax

Comparison of RNN Encoders with Source Syntax

Experimental Settings

Experiment Results

Effects on Long Sentences

Analysis on Word Alignment

Analysis on Phrase Alignment

Analysis on Over Translation

Analysis on Rare Word Translation

Related Work

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modeling Source Syntax for Neural Machine Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2017
Citations: 139	License type: cc-by

Similar Papers

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages
Tyler A Chang ... Anna Rafferty
-
Tyler A Chang, et. al.Tyler A Chang ... Anna Rafferty
01 Jan 2020
01 Jan 2020

Source Segment Encoding for Neural Machine Translation
Qiang Wang ... Jingbo Zhu
-
Qiang Wang, et. al.Qiang Wang ... Jingbo Zhu
01 Jan 2018
01 Jan 2018

Linguistic Knowledge-Aware Neural Machine Translation
Qiang Li ... Muhua Zhu
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Qiang Li, et. al.Qiang Li ... Muhua Zhu
01 Dec 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modeling Source Syntax for Neural Machine Translation

Abstract

Highlights

Summary

Talk to us

Similar Papers