Automatic Translation Research Articles

<p>With the rise of machine translation systems, it has become essential to evaluate the quality of translations produced by these systems. However, the existing evaluation metrics designed for English and other European languages may not always be suitable or apply to other Indic languages due to their complex morphology and syntax. Machine translation evaluation (MTE) is a process of assessing the quality and accuracy of the machine-translated text. MTE involves comparing the machine-translated output with the reference translation to calculate the level of similarity and correctness. Therefore, this study evaluates different metrics, namely, BLEU, METEOR, and TER to identify the most suitable evaluation metric for Indic languages. The study uses datasets for Indic languages and evaluates the metrics on various translation systems. The study contributes to the field of MT by providing insights into suitable evaluation metrics for Indic languages. This research paper aims to study and compare several lexical automatic machine translation evaluation metrics for Indic languages. For this research analysis, we have selected five language pairs of parallel corpora from the low-resource domain, such as English–Hindi, English-Punjabi, English-Gujarati, English-Marathi, and English-Bengali. All these languages belong to the Indo-Aryan language family and are resource-poor. A comparison of the state of art MT is presented and shows which translator works better on these language pairs. For this research work, the natural language toolkit tokenizers are used to assess the analysis of the experimental results. These results have been performed by taking two different datasets for all these language pairs using fully automatic MT evaluation metrics. The research study explores the effectiveness of these metrics in assessing the quality of machine translations between various Indic languages. Additionally, this dataset and analysis will make it easier to do future research in Indian MT evaluation.</p>

Abstract The mainstream machine translation model Transformer is completely based on the self-attention mechanism for translation operation, but there are still some problems, such as not being able to combine the syntactic structure information of the natural language for translation, which leads to problems such as mistranslation and omission. In this paper, for the problem that the position encoding obtained by traditional RNN and attention mechanism machine translation models using a fixed formula does not contain contextual information, the source language sequences containing contextual positional information are obtained by introducing a bidirectional long-short-term memory network and a tree-shaped long-short-term memory network, trained horizontally and vertically, respectively, and the self-attention mechanism is used in Tree-LSTM for the prediction of the contribution of the decision that The relative position information between words is preserved to the maximum extent, and finally, the Bi-Tree-LSTM translation model based on positional encoding optimization is constructed. The performance of the model is tested on four datasets: general, legal, business, film, and television, and the BLEU value of the model translation is analyzed under low data resources and increased sentence length, and then a 4000-sentence long English text is translated to check the wrong sentences and analyze the translation quality. It was found that the BLEU values of this paper’s model are 33.5, 35.2, 31.7, and 34.4 in the four types of text tests, which are the highest among the models. The BLEU of this paper’s model at 5K data volume has been as high as 26.14 points, which is 2.72 points higher than the highest score of the rest of the machine translation models at 50K data volume. The BLEU scores for 8-18 word sentences consistently remain above 45, and the peak performance is superior to that of other models. 4000 sentences of English long text translation, the total number of error sentences is 54, accounting for 1.39% of the whole text, which is lower than that of the Transformer model’s 7.15%, and the performance is in line with the expectation of the optimization design. This paper provides a new idea and useful exploration for optimizing English machine translation accuracy.

Automatic Translation Research Articles

Related Topics

Articles published on Automatic Translation

SEMANTIC LABELING OF 3D BUILDINGS BY USING GRAPH NEURAL NETWORK (GNN)

Impact of Language-Specific Training on Image Caption Synthesis: A Case Study on Low-Resource Assamese Language

The Role of Avatars in Language Learning in the Metaverse

MedLingua: A conceptual framework for a multilingual medical conversational agent

A STUDY OF GRAMMATICAL ERRORS WHEN USING TRANSLATION SOFTWARE FOR STUDENTS IN ECONOMIC CONTRACTS

A comparative analysis of lexical-based automatic evaluation metrics for different Indic language pairs

Spoken‐to‐written text conversion for enhancement of Korean–English readability and machine translation

Re-thinking Machine Translation Post-Editing Guidelines

GPT-Driven Source-to-Source Transformation for Generating Compilable Parallel CUDA Code for Nussinov’s Algorithm

Regularizing cross-attention learning for end-to-end speech translation with ASR and MT attention matrices

Automatic translation of human route descriptions into schematic maps for indoor navigation

A TinyML solution for an IoT-based communication device for hearing impaired

Enhancing Transparency in Defining Studied Drugs: The Open-Source Living DiAna Dictionary for Standardizing Drug Names in the FAERS.

Enhancing Model Performance through Translation-based Data Augmentation in the context of Fake News Detection

Application and Challenges of Information Theory in English Translation and Cross-cultural Communication

Research on Optimization Strategies for Accuracy of English Automatic Language Translation under Machine Learning Framework

Structured-light-sheet imaging in an integrated optofluidic platform.

Sign Language Recognition using Spiking Neural Networks

ТОЧНІСТЬ СЛОВОВЖИВАННЯ В МЕДІАТЕКСТІ ЯК ОДИН ІЗ ЧИННИКІВ ЕФЕКТИВНОЇ КОМУНІКАЦІЇ

An intelligent chatbot for evaluating the emotional colouring of a message and responding accordingly

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Automatic Translation Research Articles

Related Topics

Articles published on Automatic Translation

SEMANTIC LABELING OF 3D BUILDINGS BY USING GRAPH NEURAL NETWORK (GNN)

Impact of Language-Specific Training on Image Caption Synthesis: A Case Study on Low-Resource Assamese Language

The Role of Avatars in Language Learning in the Metaverse

MedLingua: A conceptual framework for a multilingual medical conversational agent

A STUDY OF GRAMMATICAL ERRORS WHEN USING TRANSLATION SOFTWARE FOR STUDENTS IN ECONOMIC CONTRACTS

A comparative analysis of lexical-based automatic evaluation metrics for different Indic language pairs

Spoken‐to‐written text conversion for enhancement of Korean–English readability and machine translation

Re-thinking Machine Translation Post-Editing Guidelines

GPT-Driven Source-to-Source Transformation for Generating Compilable Parallel CUDA Code for Nussinov’s Algorithm

Regularizing cross-attention learning for end-to-end speech translation with ASR and MT attention matrices

Automatic translation of human route descriptions into schematic maps for indoor navigation

A TinyML solution for an IoT-based communication device for hearing impaired

Enhancing Transparency in Defining Studied Drugs: The Open-Source Living DiAna Dictionary for Standardizing Drug Names in the FAERS.

Enhancing Model Performance through Translation-based Data Augmentation in the context of Fake News Detection

Application and Challenges of Information Theory in English Translation and Cross-cultural Communication

Research on Optimization Strategies for Accuracy of English Automatic Language Translation under Machine Learning Framework

Structured-light-sheet imaging in an integrated optofluidic platform.

Sign Language Recognition using Spiking Neural Networks

ТОЧНІСТЬ СЛОВОВЖИВАННЯ В МЕДІАТЕКСТІ ЯК ОДИН ІЗ ЧИННИКІВ ЕФЕКТИВНОЇ КОМУНІКАЦІЇ

An intelligent chatbot for evaluating the emotional colouring of a message and responding accordingly