Machine Translation Applications Research Articles

An English-Bengali machine translation (MT) application can convert an English text into a corresponding Bengali translation. To build a better model for this task, we can optimize English-Bengali MT. MT for languages with rich resources, like English-German, started decades ago. However, MT for languages lacking many parallel corpora remains challenging. In our study, we employed back-translation to improve the translation accuracy. With back-translation, we can have a pseudo-parallel corpus, and the generated (pseudo) corpus can be added to the original dataset to obtain an augmented dataset. However, the new data can be regarded as noisy data because they are generated by models that may not be trained very well or not evaluated well, like human translators. Since the original output of a translation model is a probability distribution of candidate words, to make the model more robust, different decoding methods are used, such as beam search, top-k random sampling and random sampling with temperature T, and others. Notably, top-k random sampling and random sampling with temperature T are more commonly used and more optimal decoding methods than the beam search. To this end, our study compares LSTM (Long-Short Term Memory, as a baseline) and Transformer. Our results show that Transformer (BLEU: 27.80 in validation, 1.33 in test) outperforms LSTM (3.62 in validation, 0.00 in test) by a large margin in the English-Bengali translation task. (Evaluating LSTM and Transformer without any augmented data is our baseline study.) We also incorporate two decoding methods, top-k random sampling and random sampling with temperature T, for back-translation that help improve the translation accuracy of the model. The results show that data generated by back-translation without top-k or temperature sampling (“no strategy”) help improve the accuracy (BLEU 38.22, +10.42 on validation, 2.07, +0.74 on test). Specifically, back-translation with top-k sampling is less effective (k=10, BLEU 29.43, +1.83 on validation, 1.36, +0.03 on test), while sampling with a proper value of T, T=0.5 makes the model achieve a higher score (T=0.5, BLEU 35.02, +7.22 on validation, 2.35, +1.02 on test). This implies that in English-Bengali MT, we can augment the training set through back-translation using random sampling with a proper temperature T.

Read full abstract

Translation is a matter of absolute necessity in the globally united and yet; linguistically and culturally separated world in which we live. Translation allows individuals who speak different languages to communicate and understand one another. The obvious limitations in human translation in terms of speed, scalability and high cost gave birth to the quest for alternative means of translation which is machine translation. Machine translation involves the use of computers to translate from one natural language (source) to another natural language (target). Machine translation applications can be developed using rule-based, example- based and statistical-based technologies. Machine translation applications have been developed to translate English language text to Igala language text using rule-based and example-based approaches. Recently, Neural Machine Translation (NMT) emerged as a new paradigm that swiftly superseded other methods of machine translation evolved with the development of deep learning. This study is aimed at developing a system that can translate contents rendered in English language to Igala language using the Neural Machine Translation with attention mechanism approach. The Neural Machine Translation model for English-to-Igala translation was built with encoder-decoder architecture with an attention layer. The model was trained with a dataset using English-Igala parallel corpus that contains 50,000 parallel sentences. The dataset was partitioned into training, validation and test set in the ratio of 80/20 percent. The model was implemented using python programming language. After training, validation and testing, the output of the model was tested on a corpus of 276 selected English texts using the Bilingual Evaluation Understudy (BLEU) method for evaluating Machine Translation system. An accuracy of 71.0% was obtained. The adoption of this model will play crucial role in facilitating everyday interaction, information exchange and active participation in society among Igala people. It will give Igala people access to information in Igala language that will enhance their standard of living as it will enable them to participate in abundant business opportunities available in the online community, and ensure proper integration in the emerging information society.

Read full abstract

Machine Translation Applications Research Articles

Related Topics

Articles published on Machine Translation Applications

Deep Learning Applications in Natural Language Processing and Optimization Strategies

On the Application of Machine Translation in Legal Terminology Translation

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation

Indigenous language technology in the age of machine learning

Word Closure-Based Metamorphic Testing for Machine Translation

Stameering Speech Signal Segmentation and Classification using Machine Learning

The Impact of Machine Translation on the Development of Tourism Translation From the Perspectives of Translators and Experts in Saudi Arabia

Implementation of Neural Machine Translation in Translating from Indonesian to Sasak Language

Data Augmentation with Translation Memories for Desktop Machine Translation Fine-tuning in 3 Language Pairs

Design and Implementation of Foreign Language Recognition and Translation APP Based on Artificial Intelligence

The application of machine translation in automatic dubbing in China: A case study of the feature film Mulan

Applying machine translation to Chinese–English subtitling: Constraints and challenges

Application of Neural Machine Translation with Attention Mechanism for Translation of Indonesian to Seram Language (Geser)

The use and abuse of artificial intelligence-enabled machine translation in the EFL classroom: An exploratory study

Development of English-to-Igala machine translation system using neural machine translation with attention mechanism

THE USE OF PARALLEL CORPORA IN TEACHING LANGUAGES AND TRANSLATION PRACTICE

Twi Machine Translation

Challenges of machine translation application to teaching ESP to construction students

Quality Evaluation of C-E Translation of Legal Texts by Mainstream Machine Translation Systems—An Example of DeepL and Metasota

An Analysis of the Evaluation of the Translation Quality of Neural Machine Translation Application Systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Machine Translation Applications Research Articles

Related Topics

Articles published on Machine Translation Applications

Deep Learning Applications in Natural Language Processing and Optimization Strategies

On the Application of Machine Translation in Legal Terminology Translation

Enhancement of English-Bengali Machine Translation Leveraging Back-Translation

Indigenous language technology in the age of machine learning

Word Closure-Based Metamorphic Testing for Machine Translation

Stameering Speech Signal Segmentation and Classification using Machine Learning

The Impact of Machine Translation on the Development of Tourism Translation From the Perspectives of Translators and Experts in Saudi Arabia

Implementation of Neural Machine Translation in Translating from Indonesian to Sasak Language

Data Augmentation with Translation Memories for Desktop Machine Translation Fine-tuning in 3 Language Pairs

Design and Implementation of Foreign Language Recognition and Translation APP Based on Artificial Intelligence

The application of machine translation in automatic dubbing in China: A case study of the feature film Mulan

Applying machine translation to Chinese–English subtitling: Constraints and challenges

Application of Neural Machine Translation with Attention Mechanism for Translation of Indonesian to Seram Language (Geser)

The use and abuse of artificial intelligence-enabled machine translation in the EFL classroom: An exploratory study

Development of English-to-Igala machine translation system using neural machine translation with attention mechanism

THE USE OF PARALLEL CORPORA IN TEACHING LANGUAGES AND TRANSLATION PRACTICE

Twi Machine Translation

Challenges of machine translation application to teaching ESP to construction students

Quality Evaluation of C-E Translation of Legal Texts by Mainstream Machine Translation Systems—An Example of DeepL and Metasota

An Analysis of the Evaluation of the Translation Quality of Neural Machine Translation Application Systems