Ranking vs. regression in machine translation evaluation

Kevin Duh

doi:10.3115/1626394.1626425

Abstract

Automatic evaluation of machine translation (MT) systems is an important research topic for the advancement of MT technology. Most automatic evaluation methods proposed to date are score-based: they compute scores that represent translation quality, and MT systems are compared on the basis of these scores. We advocate an alternative perspective of automatic MT evaluation based on ranking. Instead of producing scores, we directly produce a ranking over the set of MT systems to be compared. This perspective is often simpler when the evaluation goal is system comparison. We argue that it is easier to elicit human judgments of ranking and develop a machine learning approach to train on rank data. We compare this ranking method to a score-based regression method on WMT07 data. Results indicate that ranking achieves higher correlation to human judgments, especially in cases where ranking-specific features are used.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ranking vs. regression in machine translation evaluation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Development of machine translation technology for assisting health communication: A systematic review.
Kristin N Dew ... Katrin Kirchhoff
Journal of Biomedical Informatics | VOL. 85
Kristin N Dew, et. al.Kristin N Dew ... Katrin Kirchhoff
19 Jul 2018
Journal of Biomedical Informatics | VOL. 85

Machine Translation Evaluation and Optimization
Bonnie Dorr ... John Mccary
-
Bonnie Dorr, et. al.Bonnie Dorr ... John Mccary
01 Jan 2010
01 Jan 2010

Using Multiple Edit Distances to Automatically Grade Outputs From Machine Translation Systems
Y Akiba ... E Sumita
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
Y Akiba, et. al.Y Akiba ... E Sumita
01 Mar 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

A Naïve Automatic MT Evaluation Method without Reference Translations
Junjie Jiang ... Youfang Lin
-
Junjie Jiang, et. al.Junjie Jiang ... Youfang Lin
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ranking vs. regression in machine translation evaluation

Abstract

Talk to us

Similar Papers