Tied Transformers: Neural Machine Translation with Shared Encoder and Decoder

Yingce Xia,Xu Tan,Tao Qin,Di He,Tianyu He,Fei Tian

doi:10.1609/aaai.v33i01.33015466

Abstract

Sharing source and target side vocabularies and word embeddings has been a popular practice in neural machine translation (briefly, NMT) for similar languages (e.g., English to French or German translation). The success of such wordlevel sharing motivates us to move one step further: we consider model-level sharing and tie the whole parts of the encoder and decoder of an NMT model. We share the encoder and decoder of Transformer (Vaswani et al. 2017), the state-of-the-art NMT model, and obtain a compact model named Tied Transformer. Experimental results demonstrate that such a simple method works well for both similar and dissimilar language pairs. We empirically verify our framework for both supervised NMT and unsupervised NMT: we achieve a 35.52 BLEU score on IWSLT 2014 German to English translation, 28.98/29.89 BLEU scores on WMT 2014 English to German translation without/with monolingual data, and a 22.05 BLEU score on WMT 2016 unsupervised German to English translation.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tied Transformers: Neural Machine Translation with Shared Encoder and Decoder

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 53

Similar Papers

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings
Shaohui Kuang ... Weihua Luo
-
Shaohui Kuang, et. al.Shaohui Kuang ... Weihua Luo
01 Jan 2018
01 Jan 2018

Bidirectional Boost: On Improving Tibetan-Chinese Neural Machine Translation With Back-Translation and Self-Learning
Sangjie Duanzhu ... Cairang Jia
-
Sangjie Duanzhu, et. al.Sangjie Duanzhu ... Cairang Jia
24 Dec 2020
24 Dec 2020

Development of Neural Machine Translator for English-Assamese Language Pair
Basab Nath ... Surajeet Das
-
Basab Nath, et. al.Basab Nath ... Surajeet Das
03 Aug 2021
03 Aug 2021

Improving Neural Machine Translation Models with Monolingual Data
Rico Sennrich ... Alexandra Birch
-
Rico Sennrich, et. al.Rico Sennrich ... Alexandra Birch
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tied Transformers: Neural Machine Translation with Shared Encoder and Decoder

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence