Multilingual Neural Machine Translation Research Articles

The method of translation from one language to another without human intervention is known as Machine Translation (MT). Multilingual neural machine translation (MNMT) is a technique for MT that builds a single model for multiple languages. It is preferred over other approaches, since it decreases training time and improves translation in low-resource contexts, i.e., for languages that have insufficient corpus. However, good-quality MT models are yet to be built for many scenarios such as for Indic-to-Indic Languages (IL-IL). Hence, this article is an attempt to address and develop the baseline models for low-resource languages i.e., IL-IL (for 11 Indic Languages (ILs)) in a multilingual environment. The models are built on the Samanantar corpus and analyzed on the Flores-200 corpus. All the models are evaluated using standard evaluation metrics i.e., Bilingual Evaluation Understudy (BLEU) score (with the range of 0 to 100). This article examines the effect of the grouping of related languages, namely, East Indo-Aryan (EI), Dravidian (DR), and West Indo-Aryan (WI) on the MNMT model. From the experiments, the results reveal that related language grouping is beneficial for the WI group only while it is detrimental for the EI group and it shows an inconclusive effect on the DR group. The role of pivot-based MNMT models in enhancing translation quality is also investigated in this article. Owing to the presence of large good-quality corpora from English (EN) to ILs, MNMT IL-IL models using EN as a pivot are built and examined. To achieve this, English-Indic Language (EN-IL) models are developed with and without the usage of related languages. Results show that the use of related language grouping is advantageous specifically for EN to ILs. Thus, related language groups are used for the development of pivot MNMT models. It is also observed that the usage of pivot models greatly improves MNMT baselines. Furthermore, the effect of transliteration on ILs is also analyzed in this article. To explore transliteration, the best MNMT models from the previous approaches (in most of cases pivot model using related groups) are determined and built on corpus transliterated from the corresponding scripts to a modified Indian language Transliteration script (ITRANS). The outcome of the experiments indicates that transliteration helps the models built for lexically rich languages, with the best increment of BLEU scores observed in Malayalam (ML) and Tamil (TA), i.e., 6.74 and 4.72, respectively. The BLEU score using transliteration models ranges from 7.03 to 24.29. The best model obtained is the Punjabi (PA)-Hindi (HI) language pair trained on PA-WI transliterated corpus.

Read full abstract

Abstract: Natural Language Processing (NLP), in the form recognizable today, really began to take hold in the 1980s, when machine learning helped propel it to soaring heights. However, due to a lack of processing power, machine learning, and to an extent, NLP, started slowing down in innovation and ideas and had almost ground to a relative halt, until the last decade, when a sudden increase in both productivity and interest in the machine learning helped increase the amount of knowledge in the space itself. This review provides several different case studies using different methodologies. The first paper was a deep analysis on how researchers were able to use Tesseract and Google Vision in tandem with automatic data mining methods to enrich the Cherokee language database in order to preserve it from extinction. The second paper takes a query translation-based approach toward translating English to Indian languages and utilizes a Multilingual Cross-Language Information Retrieval (MLCIR) system with tools such as Part of Speech Tagger (POST), Stop-Word, and Porter Stemmer. The third paper presents CoVe, which transfers knowledge from machine translation to improve performance on NLP tasks like sentiment analysis and question answering by using contextualized word vectors along with word embeddings, achieving new state-of-the-art results on some datasets. The fourth paper aims to translate English to Pakistan Sign Language (PSL) and also uses POST and goes through dependency analysis, sentence classification, and PSL using PLS trees. The fifth paper uses a Multilingual Neural Machine Translation (NMT) system for LowResource languages and incorporates two main models: a recurrent NMT and a transformer NMT. The sixth paper analyzes how a fine-tuned transformer model seems to work better than transformer models trained from scratch on high-resource languages, while vice-versa seems to occur for low-resource languages. The seventh paper adds to this by talking about how multilingual translation seems to work better than a back-translation model. Given the diverse array of approaches that could be used, we aim to identify the most efficient and correct methodology for future researchers to use in their work, based on the papers in this literature review.

Read full abstract

Multilingual Neural Machine Translation Research Articles

Related Topics

Articles published on Multilingual Neural Machine Translation

On the shortcut learning in multilingual neural machine translation

Multilingual Neural Machine Translation for Indic to Indic Languages

Advancements in Neural Machine Translation: Methodological Innovations and Empirical Insights for Cross-Linguistic Discourse Preservation

Unsupervised multilingual machine translation with pretrained cross-lingual encoders

Language relatedness evaluation for multilingual neural machine translation

Towards better Chinese-centric neural machine translation for low-resource languages

Low-resource Multilingual Neural Translation Using Linguistic Feature-based Relevance Mechanisms

Improving Multilingual Neural Machine Translation System for Indic Languages

Improving Many-to-Many Neural Machine Translation via Selective and Aligned Online Data Augmentation

Rectifying Ill-Formed Interlingual Space: A Framework for Zero-Shot Translation on Modularized Multilingual NMT

Parameter Differentiation Based Multilingual Neural Machine Translation

Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters

Synchronous Inference for Multilingual Neural Machine Translation

Open and Competitive Multilingual Neural Machine Translation in Production

A Survey of Orthographic Information in Machine Translation

Synchronous Interactive Decoding for Multilingual Neural Machine Translation

A Survey of Multilingual Neural Machine Translation

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation

Cross-Lingual Pre-Training Based Transfer for Zero-Shot Neural Machine Translation

On the Linguistic Representational Power of Neural Machine Translation Models

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multilingual Neural Machine Translation Research Articles

Related Topics

Articles published on Multilingual Neural Machine Translation

On the shortcut learning in multilingual neural machine translation

Multilingual Neural Machine Translation for Indic to Indic Languages

Advancements in Neural Machine Translation: Methodological Innovations and Empirical Insights for Cross-Linguistic Discourse Preservation

Unsupervised multilingual machine translation with pretrained cross-lingual encoders

Language relatedness evaluation for multilingual neural machine translation

Towards better Chinese-centric neural machine translation for low-resource languages

Low-resource Multilingual Neural Translation Using Linguistic Feature-based Relevance Mechanisms

Improving Multilingual Neural Machine Translation System for Indic Languages

Improving Many-to-Many Neural Machine Translation via Selective and Aligned Online Data Augmentation

Rectifying Ill-Formed Interlingual Space: A Framework for Zero-Shot Translation on Modularized Multilingual NMT

Parameter Differentiation Based Multilingual Neural Machine Translation

Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters

Synchronous Inference for Multilingual Neural Machine Translation

Open and Competitive Multilingual Neural Machine Translation in Production

A Survey of Orthographic Information in Machine Translation

Synchronous Interactive Decoding for Multilingual Neural Machine Translation

A Survey of Multilingual Neural Machine Translation

Evaluating the Cross-Lingual Effectiveness of Massively Multilingual Neural Machine Translation

Cross-Lingual Pre-Training Based Transfer for Zero-Shot Neural Machine Translation

On the Linguistic Representational Power of Neural Machine Translation Models