Human or Neural Translation?

Shivendra Bhardwaj,Gabriel Bernier-Colborne,David Alfonso Hermelo,Cyril Goutte,Michel Simard,Phillippe Langlais

doi:10.18653/v1/2020.coling-main.576

Abstract

Deep neural models tremendously improved machine translation. In this context, we investigate whether distinguishing machine from human translations is still feasible. We trained and applied 18 classifiers under two settings: a monolingual task, in which the classifier only looks at the translation; and a bilingual task, in which the source text is also taken into consideration. We report on extensive experiments involving 4 neural MT systems (Google Translate, DeepL, as well as two systems we trained) and varying the domain of texts. We show that the bilingual task is the easiest one and that transfer-based deep-learning classifiers perform best, with mean accuracies around 85% in-domain and 75% out-of-domain .

Highlights

This work addresses the task of distinguishing between translations produced by humans and machines
We compare feature-based approaches with several deep learning methods, investigating the impact of text domains and MT systems, paying attention to cases where the translation engine at test time is different from the one used for training, which we found often not studied in related work
The best transfer learning method we tested recorded an in-domain accuracy of 87.6% and out-of-domain performances ranging between 65.4% and 84.2% depending on the domain of texts and MT system considered

Summary

Introduction

This work addresses the task of distinguishing between translations produced by humans and machines. Practical applications for this include: improving machine translation systems (Li et al, 2015), filtering parallel data mined from the Web (Arase and Zhou, 2013) and evaluating machine translation quality without reference translations (Aharoni et al, 2014). We compare feature-based approaches with several deep learning methods, investigating the impact of text domains and MT systems (in-house neural engines, Google Translate, DeepL), paying attention to cases where the translation engine at test time is different from the one used for training, which we found often not studied in related work. We believe our study offers many new data points, and hope it will foster research on this timely topic

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human or Neural Translation?

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 4	License type: cc-by

Similar Papers

Human versus Neural Machine Translation Creativity: A Study on Manipulated MWEs in Literature
Gloria Corpas Pastor ... Laura Noriega-Santiáñez
Information | VOL. 15
Gloria Corpas Pastor, et. al.Gloria Corpas Pastor ... Laura Noriega-Santiáñez
02 Sep 2024
Information | VOL. 15

Human and machine translation of occasionalisms in literary texts
Waltraud Kolb ... Wolfgang U Dressler
Target | VOL. 35
Waltraud Kolb, et. al.Waltraud Kolb ... Wolfgang U Dressler
03 Apr 2023
Target | VOL. 35

Neural machine translation and human translation
Anfeng Sheng ... Yankun Kong
Babel | VOL. -
Anfeng Sheng, et. al.Anfeng Sheng ... Yankun Kong
24 Jul 2023
Babel | VOL. -

Automated and Human Interaction in Written Discourse: A Contrastive Parallel Corpus-based Investigation of Metadiscourse Features in Machine-Human Translations
Muhammad Afzaal ... Muhammad Imran
SAGE Open | VOL. 12
Muhammad Afzaal, et. al.Muhammad Afzaal ... Muhammad Imran
01 Oct 2022
SAGE Open | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human or Neural Translation?

Abstract

Highlights

Summary

Talk to us

Similar Papers