Improving Word Alignment by Adding Gromov-Wasserstein into Attention Neural Network

Yan Huang,Tianyuan Zhang,Huidong Zhu

doi:10.1088/1742-6596/2171/1/012043

Yan Huang, Tianyuan Zhang + Show 1 more

Open Access

https://doi.org/10.1088/1742-6596/2171/1/012043

Copy DOI

Abstract

Statistical machine translation systems usually break the translation task into two or more subtasks and an important one is finding word alignments over a parallel sentence bilingual corpus. We address the problem of introducing word alignment for language pairs by developing a novel neural network model that can applied to other generative alignment models. We use Multi-layer attention model and multi-layer model with multi-head-attention mechanism on each layer provides superior translation quality. It can be trained on bilingual data without relying on word alignment. In this paper, we cast the correspondence problem directly as an optimal distance problem. We use the Gromov-Wasserstein distance to calculated how similarities between word pairs are related across languages. The resulting alignments dramatically outperform the GIZA++ and FastAlign approach, these alignments are comparable on public data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Jan 1, 2022
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Improving Word Alignment by Adding Gromov-Wasserstein into Attention Neural Network

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

What types of word alignment improve statistical machine translation?
Patrik Lambert ... Yanjun Ma
Machine Translation | VOL. 26
Patrik Lambert, et. al.Patrik Lambert ... Yanjun Ma
10 Mar 2012
Machine Translation | VOL. 26

Using Statistical Machine Translation to Grade Training Data
Andrew Finch ... Eiichiro Sumita
-
Andrew Finch, et. al.Andrew Finch ... Eiichiro Sumita
01 Dec 2008
01 Dec 2008

Benefits of Morphosyntactic Features on English-Arabic Statistical Machine Translation
Safae Berrichi ... Azzeddine Mazroui
-
Safae Berrichi, et. al.Safae Berrichi ... Azzeddine Mazroui
01 Oct 2018
01 Oct 2018

Discriminative Word Alignment by Linear Modeling
Yang Liu ... Qun Liu
Computational Linguistics | VOL. 36
Yang Liu, et. al.Yang Liu ... Qun Liu
01 Sep 2010
Computational Linguistics | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Word Alignment by Adding Gromov-Wasserstein into Attention Neural Network

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series