Improving Context-Aware Neural Machine Translation Using Self-Attentive Sentence Embedding

Hyeongu Yun,Yongkeun Hwang,Kyomin Jung

doi:10.1609/aaai.v34i05.6494

Abstract

Fully Attentional Networks (FAN) like Transformer (Vaswani et al. 2017) has shown superior results in Neural Machine Translation (NMT) tasks and has become a solid baseline for translation tasks. More recent studies also have reported experimental results that additional contextual sentences improve translation qualities of NMT models (Voita et al. 2018; Müller et al. 2018; Zhang et al. 2018). However, those studies have exploited multiple context sentences as a single long concatenated sentence, that may cause the models to suffer from inefficient computational complexities and long-range dependencies. In this paper, we propose Hierarchical Context Encoder (HCE) that is able to exploit multiple context sentences separately using the hierarchical FAN structure. Our proposed encoder first abstracts sentence-level information from preceding sentences in a self-attentive way, and then hierarchically encodes context-level information. Through extensive experiments, we observe that our HCE records the best performance measured in BLEU score on English-German, English-Turkish, and English-Korean corpus. In addition, we observe that our HCE records the best performance in a crowd-sourced test set which is designed to evaluate how well an encoder can exploit contextual information. Finally, evaluation on English-Korean pronoun resolution test suite also shows that our HCE can properly exploit contextual information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Context-Aware Neural Machine Translation Using Self-Attentive Sentence Embedding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 3, 2020
Citations: 13

Similar Papers

On the Importance of Word Boundaries in Character-level Neural Machine Translation
Duygu Ataman ... Alexandra Birch
-
Duygu Ataman, et. al.Duygu Ataman ... Alexandra Birch
01 Jan 2019
01 Jan 2019

Neural Machine Translation: A Review of the Approaches
Kamya Eria ... Manoj Jayabalan
Journal of Computational and Theoretical Nanoscience | VOL. 16
Kamya Eria, et. al.Kamya Eria ... Manoj Jayabalan
01 Aug 2019
Journal of Computational and Theoretical Nanoscience | VOL. 16

Character-Aware Low-Resource Neural Machine Translation with Weight Sharing and Pre-training
Yichao Cao ... Miao Li
-
Yichao Cao, et. al.Yichao Cao ... Miao Li
01 Jan 2019
01 Jan 2019

Neural Machine Translation with CARU-Embedding Layer and CARU-Gated Attention Layer
Sio-Kei Im ... Ka-Hou Chan
Mathematics | VOL. 12
Sio-Kei Im, et. al.Sio-Kei Im ... Ka-Hou Chan
27 Mar 2024
Mathematics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Context-Aware Neural Machine Translation Using Self-Attentive Sentence Embedding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence