Towards Better Word Alignment in Transformer

Kai Song,Yue Zhang,Weihua Luo,Xiaoqing Zhou,Zhongqiang Huang,Xiangyu Duan,Heng Yu,Min Zhang

doi:10.1109/taslp.2020.2998278

Abstract

While neural models based on the Transformer architecture achieve the State-of-the-Art translation performance, it is well known that the learned target-to-source attentions do not correlate well with word alignment. There is an increasing interest in inducing accurate word alignment in Transformer, due to its important role in practical applications such as dictionary-guided translation and interactive translation. In this article, we extend and improve the recent work on unsupervised learning of word alignment in Transformer on two dimensions: a) parameter initialization from a pre-trained cross-lingual language model to leverage large amounts of monolingual data for learning robust contextualized word representations, and b) regularization of the training objective to directly model characteristics of word alignments which results in favorable word alignments receiving more concentrated probabilities. Experiments on benchmark data sets of three language pairs show that the proposed methods can significantly reduce alignment error rate (AER) by at least 3.7 to 7.7 points on each language pair over two recent works on improving the Transformer's word alignment. Moreover, our methods can achieve better alignment results than GIZA++ on certain test sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards Better Word Alignment in Transformer

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2020
Citations: 42

Similar Papers

POS-based word alignment for small corpus
Jyoti Srivastava ... Sudip Sanyal
-
Jyoti Srivastava, et. al.Jyoti Srivastava ... Sudip Sanyal
01 Oct 2015
01 Oct 2015

Annotated Guidelines and Building Reference Corpus for Myanmar-English Word Alignment
Nway Nway Han ... Aye Thida
International Journal on Natural Language Computing | VOL. 8
Nway Nway Han, et. al.Nway Nway Han ... Aye Thida
31 Aug 2019
International Journal on Natural Language Computing | VOL. 8

Integration Algorithm of English-Chinese Word Segmentation and Alignment
Zhi-Ming Xu ... Chun-Yu Kit
-
Zhi-Ming Xu, et. al.Zhi-Ming Xu ... Chun-Yu Kit
01 Jan 2006
01 Jan 2006

Segmenting Long Sentence Pairs to Improve Word Alignment in English-Hindi Parallel Corpora
Jyoti Srivastava ... Sudip Sanyal
-
Jyoti Srivastava, et. al.Jyoti Srivastava ... Sudip Sanyal
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards Better Word Alignment in Transformer

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing