Hindi to English: Transformer-Based Neural Machine Translation

Kavit Gangar,Hardik Ruparel,Shreyas Lele

doi:10.1007/978-981-33-4909-4_25

Abstract

Machine Translation (MT) is one of the most prominent tasks in Natural Language Processing (NLP) which involves the automatic conversion of texts from one natural language to another while preserving its meaning and fluency. Although the research in machine translation has been going on since multiple decades, the newer approach of integrating deep learning techniques in natural language processing has led to significant improvements in the translation quality. This paper has developed a Neural Machine Translation (NMT) system by training the Transformer model to translate texts from Indian Language Hindi to English. Hindi being a low resource language has made it difficult for neural networks to understand the language thereby leading to a slow growth in the development of neural machine translators. Thus, to address this gap, back-translation is implemented to augment the training data and for creating the vocabulary, it has been experimented with both word and subword level tokenization using Byte Pair Encoding (BPE) thereby ending up training the Transformer in 10 different configurations. This led us to achieve a state-of-the-art BLEU score of 24.53 on the test set of IIT Bombay English-Hindi Corpus in one of the configurations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hindi to English: Transformer-Based Neural Machine Translation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Natural Language Processing and Computational Linguistics
Junichi Tsujii
Computational Linguistics | VOL. -
Junichi TsujiiJunichi Tsujii
07 Dec 2021
Computational Linguistics | VOL. -

Improving the Performance of Vietnamese–Korean Neural Machine Translation with Contextual Embedding
Van-Hai Vu ... Cheol-Young Ock
Applied Sciences | VOL. 11
Van-Hai Vu, et. al.Van-Hai Vu ... Cheol-Young Ock
23 Nov 2021
Applied Sciences | VOL. 11

Machine translation of standardised medical terminology using natural language processing: A scoping review
Richard Noll ... Jannik Schaaf
New Biotechnology | VOL. 77
Richard Noll, et. al.Richard Noll ... Jannik Schaaf
29 Aug 2023
New Biotechnology | VOL. 77

Framework for Handling Rare Word Problems in Neural Machine Translation System Using Multi-Word Expressions
Kamal Deep Garg ... Bhisham Sharma
Applied Sciences | VOL. 12
Kamal Deep Garg, et. al.Kamal Deep Garg ... Bhisham Sharma
31 Oct 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hindi to English: Transformer-Based Neural Machine Translation

Abstract

Talk to us

Similar Papers