Hybrid Machine Translation with Multi-Source Encoder-Decoder Long Short-Term Memory in English-Malay Translation

Yin-Lai Yeong,Tien-Ping Tan,Siti Khaotijah Mohammad,Keng Hoon Gan

doi:10.18517/ijaseit.8.4-2.6816

Abstract

<p class='IJASEITAbtract'>Statistical Machine Translation (SMT) and Neural Machine Translation (NMT) are the state-of-the-art approaches in machine translation (MT). The translation produced by a SMT is based on the statistical analysis of text corpora, while NMT uses deep neural network to model and to generate a translation. SMT and NMT have their strength and weaknesses. SMT may produce better translation with a small parallel text corpus compared to NMT. Nevertheless, when the amount of parallel text available is large, the quality of the translation produced by NMT is often higher than SMT. Besides that, study also shown that the translation produced by SMT is better than NMT in cases where there is a domain mismatch between training and testing. SMT also has an advantage on long sentences. In addition, when a translation produced by an NMT is wrong, it is very difficult to find the error. In this paper, we investigate a hybrid approach that combine SMT and NMT to perform English to Malay translation. The motivation of using a hybrid machine translation is to combine the strength of both approaches to produce a more accurate translation. Our approach uses the multi-source encoder-decoder long short-term memory (LSTM) architecture. The architecture uses two encoders, one to embed the sentence to be translated, and another encoder to embed the initial translation produced by SMT. The translation from the SMT can be viewed as a “suggestion translation” to the neural MT. Our experiments show that the hybrid MT increases the BLEU scores of our best baseline machine translation in computer science domain and news domain from 21.21 and 48.35 to 35.97 and 61.81 respectively.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal on Advanced Science, Engineering and Information Technology	Publication Date: Sep 26, 2018
Citations: 3	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

Hybrid Machine Translation with Multi-Source Encoder-Decoder Long Short-Term Memory in English-Malay Translation

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology

Lead the way for us

Similar Papers

Multilingual Neural Translation

-

14 Feb 2020
14 Feb 2020

Adaptation in Statistical Machine Translation for Low-resource Domains in English-Vietnamese Language
Nghia-Luan Pham ... Van-Vinh Nguyen
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Nghia-Luan Pham, et. al.Nghia-Luan Pham ... Van-Vinh Nguyen
30 May 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

A Pragmatic Analysis of Machine Translation Techniques for Preserving the Authenticity of the Sanskrit Language
Nandini Sethi ... Poonam Bansal
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Nandini Sethi, et. al.Nandini Sethi ... Poonam Bansal
25 Jul 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

A Comparison of Transformer, Recurrent Neural Networks and SMT in Tamil to Sinhala MT
Ruvan Weerasinghe ... Randil Pushpananda
-
Ruvan Weerasinghe, et. al.Ruvan Weerasinghe ... Randil Pushpananda
04 Nov 2020
04 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid Machine Translation with Multi-Source Encoder-Decoder Long Short-Term Memory in English-Malay Translation

Abstract

Talk to us

Similar Papers

More From: International Journal on Advanced Science, Engineering and Information Technology