Improving Machine Translation Quality with Denoising Autoencoder and Pre-Ordering

Tran Hong-Viet,Nguyen Van-Vinh,Nguyen Hoang-Quan

doi:10.20532/cit.2021.1005316

Abstract

The problems in machine translation are related to the characteristics of a family of languages, especially syntactic divergences between languages. In the translation task, having both source and target languages in the same language family is a luxury that cannot be relied upon. The trained models for the task must overcome such differences either through manual augmentations or automatically inferred capacity built into the model design. In this work, we investigated the impact of multiple methods of differing word orders during translation and further experimented in assimilating the source languages syntax to the target word order using pre-ordering. We focused on the field of extremely low-resource scenarios. We also conducted experiments on practical data augmentation techniques that support the reordering capacity of the models through varying the target objectives, adding the secondary goal of removing noises or reordering broken input sequences. In particular, we propose methods to improve translat on quality with the denoising autoencoder in Neural Machine Translation (NMT) and pre-ordering method in Phrase-based Statistical Machine Translation (PBSMT). The experiments with a number of English-Vietnamese pairs show the improvement in BLEU scores as compared to both the NMT and SMT systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Computing and Information Technology	Publication Date: Mar 21, 2022
Citations: 1	License type: other-oa

R Discovery Prime

R Discovery Prime

Improving Machine Translation Quality with Denoising Autoencoder and Pre-Ordering

Abstract

Talk to us

Similar Papers

More From: Journal of Computing and Information Technology

Lead the way for us

Similar Papers

Baidu Translate: Research and Products
Zhongjun He
-
Zhongjun HeZhongjun He
01 Jan 2015
01 Jan 2015

Adaptation in Statistical Machine Translation for Low-resource Domains in English-Vietnamese Language
Nghia-Luan Pham ... Van-Vinh Nguyen
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Nghia-Luan Pham, et. al.Nghia-Luan Pham ... Van-Vinh Nguyen
30 May 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

Neural Machine Translation: A Review of the Approaches
Kamya Eria ... Manoj Jayabalan
Journal of Computational and Theoretical Nanoscience | VOL. 16
Kamya Eria, et. al.Kamya Eria ... Manoj Jayabalan
01 Aug 2019
Journal of Computational and Theoretical Nanoscience | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Machine Translation Quality with Denoising Autoencoder and Pre-Ordering

Abstract

Talk to us

Similar Papers

More From: Journal of Computing and Information Technology