A Quantitative Analysis of Discourse Phenomena in Machine Translation

Carolina Scarton,Lucia Specia

doi:10.4000/discours.9047

Abstract

State-of-the-art Machine Translation (MT) systems translate documents by considering isolated sentences, disregarding information beyond sentence level. As a result, machine-translated documents often contain problems related to discourse coherence and cohesion. Recently, some initiatives in the evaluation and quality estimation of MT outputs have attempted to detect discourse problems in order to assess the quality of these machine translations. However, a quantitative analysis of discourse phenomena in MT outputs is still needed in order to better understand the phenomena and identify possible solutions or ways to improve evaluation. This paper aims to answer the following questions: What is the impact of discourse phenomena on MT quality? Can we capture and measure quantitatively any issues related to discourse in MT outputs? In order to answer these questions, we present a quantitative analysis of several discourse phenomena and correlate the resulting figures with scores from automatic translation quality evaluation metrics. We show that figures related to discourse phenomena present a higher correlation with quality scores than the baseline counts widely used for quality estimation of MT.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Discours	Publication Date: Sep 9, 2015
Citations: 12	License type: cc-by

R Discovery Prime

R Discovery Prime

A Quantitative Analysis of Discourse Phenomena in Machine Translation

Abstract

Talk to us

Similar Papers

More From: Discours

Lead the way for us

Similar Papers

A Survey on Evaluation Metrics for Machine Translation
Seungjun Lee ... Seonmin Koo
Mathematics | VOL. 11
Seungjun Lee, et. al.Seungjun Lee ... Seonmin Koo
16 Feb 2023
Mathematics | VOL. 11

Identification of Relevant and Redundant Automatic Metrics for MT Evaluation
Michal Munk ... Daša Munková
-
Michal Munk, et. al.Michal Munk ... Daša Munková
01 Jan 2015
01 Jan 2015

An in-depth analysis of the individual impact of controlled language rules on machine translation output: a mixed-methods approach
Shaimaa Marzouk
Machine Translation | VOL. 35
Shaimaa MarzoukShaimaa Marzouk
01 Jun 2021
Machine Translation | VOL. 35

Toward a Cognitive Evaluation Approach for Machine Translation PostEditing
Wajdi Zaghouani ... Irina Temnikova
-
Wajdi Zaghouani, et. al.Wajdi Zaghouani ... Irina Temnikova
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Quantitative Analysis of Discourse Phenomena in Machine Translation

Abstract

Talk to us

Similar Papers

More From: Discours