Improving English-Assamese Neural Machine Translation Using Transliteration-Based Approach

Sahinur Rahman Laskar,Partha Pakray,Sivaji Bandyopadhyay,Bishwaraj Paul

doi:10.1007/978-981-19-7513-4_20

Abstract

Natural language translation is a well-defined task of linguistic technology that minimizes communication gap among people of diverse linguistic backgrounds. Although neural machine translation attains remarkable translational performance, it requires adequate amount of train data, which is a challenging task for low-resource language pair translation. Also, neural machine translation handles rare word problems, i.e., low-frequency words translation at the subword level, but it shows weakness for highly inflected language translation. In this work, we have explored neural machine translation on low-resource English-Assamese language pair with a proposed transliteration approach in the data preprocessing step. In the transliteration approach, the source language is transliterated into target language script that leverages a smaller subword vocabulary for the source-target languages. Moreover, the pre-trained embeddings on the monolingual data of transliterated source and target languages are used in the training process. With our approach, the neural machine translation significantly improves translational performance for English-to-Assamese and Assamese-to-English translation and obtain state-of-the-art results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving English-Assamese Neural Machine Translation Using Transliteration-Based Approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Addressing domain shift in neural machine translation via reinforcement learning
Amit Kumar ... Sriparna Saha
Expert Systems with Applications | VOL. 201
Amit Kumar, et. al.Amit Kumar ... Sriparna Saha
09 Apr 2022
Expert Systems with Applications | VOL. 201

Neural Machine Translation for Low-Resource Languages from a Chinese-centric Perspective: A Survey
Jinyi Zhang ... Jiannan Mao
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Jinyi Zhang, et. al.Jinyi Zhang ... Jiannan Mao
16 May 2024
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

Baidu Translate: Research and Products
Zhongjun He
-
Zhongjun HeZhongjun He
01 Jan 2015
01 Jan 2015

Character-Aware Low-Resource Neural Machine Translation with Weight Sharing and Pre-training
Yichao Cao ... Miao Li
-
Yichao Cao, et. al.Yichao Cao ... Miao Li
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving English-Assamese Neural Machine Translation Using Transliteration-Based Approach

Abstract

Talk to us

Similar Papers