Exploring Graph-based Transformer Encoder for Low-Resource Neural Machine Translation

Long H B Nguyen,Binh Nguyen,Binh Le,Dien Dinh

doi:10.1145/3599969

Abstract

The Transformer is commonly used in Neural Machine Translation (NMT), but it faces issues with over-parameterization in low-resource settings. This means that simply increasing the model parameters significantly will not lead to improved performance. In this study, we propose a graph-based approach that slightly increases the parameters while significantly outperforming the scaled version of the Transformer. We accomplish this by utilizing Graph Neural Networks to encode Universal Conceptual Cognitive Annotation (UCCA), allowing the linguistic features of UCCA to be incorporated into the word embeddings. This improves the performance of the NMT system since the word embedding is now more capable and informative. Experimental results demonstrate that the proposed method outperforms the scaled Transformer model by +0.4, +0.41, and +0.33 BLEU, respectively, in English-Vietnamese/French/Czech datasets. Furthermore, this method reduces the number of parameters by 47% when compared to the scaled Transformer. A thorough analysis of error patterns reveals that the proposed method provides structural awareness to translation systems. Our code is available at: https://github.com/nqbinh17/UCCA_GNN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring Graph-based Transformer Encoder for Low-Resource Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: May 25, 2023
Citations: 1

Similar Papers

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

Handling Unknown Words in Neural Machine Translation System
Kamal Deep Garg ... Vandana Saini
-
Kamal Deep Garg, et. al.Kamal Deep Garg ... Vandana Saini
08 Nov 2020
08 Nov 2020

Towards better Chinese-centric neural machine translation for low-resource languages
Bin Li ... Hanjun Deng
Computer Speech & Language | VOL. 84
Bin Li, et. al.Bin Li ... Hanjun Deng
11 Sep 2023
Computer Speech & Language | VOL. 84

Iterative Training of Unsupervised Neural and Statistical Machine Translation Systems
Benjamin Marie ... Atsushi Fujita
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19
Benjamin Marie, et. al.Benjamin Marie ... Atsushi Fujita
01 Jun 2020
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring Graph-based Transformer Encoder for Low-Resource Neural Machine Translation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing