Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

George Doddington

doi:10.3115/1289189.1289273

Abstract

Evaluation is recognized as an extremely helpful forcing function in Human Language Technology R&D. Unfortunately, has not been a very powerful tool in machine translation (MT) research because it requires human judgments and is thus expensive and time-consuming and not easily factored into the MT research agenda. However, at the July 2001 TIDES PI meeting in Philadelphia, IBM described an automatic MT technique that can provide immediate feedback and guidance in MT research. Their idea, which they call an evaluation understudy, compares MT output with expert reference translations in terms of the statistics of short sequences of words (word N-grams). The more of these N-grams that a translation shares with the reference translations, the better the translation is judged to be. The idea is elegant in its simplicity. But far more important, IBM showed a strong correlation between these automatically generated scores and human judgments of translation quality. As a result, DARPA commissioned NIST to develop an MT facility based on the IBM work. This utility is now available from NIST and serves as the primary measure for TIDES MT research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Evaluating Terminology Translation in MT
Rejwanul Haque ... Andy Way
-
Rejwanul Haque, et. al.Rejwanul Haque ... Andy Way
01 Jan 2023
01 Jan 2023

Using Multiple Edit Distances to Automatically Grade Outputs From Machine Translation Systems
Y Akiba ... E Sumita
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14
Y Akiba, et. al.Y Akiba ... E Sumita
01 Mar 2006
IEEE Transactions on Audio, Speech and Language Processing | VOL. 14

Comparing Statistical and Neural Machine Translation Performance on Hindi-To-Tamil and English-To-Tamil
Akshai Ramesh ... Rejwanul Haque
Digital | VOL. 1
Akshai Ramesh, et. al.Akshai Ramesh ... Rejwanul Haque
02 Apr 2021
Digital | VOL. 1

국내 번역학 기계번역 연구 동향 : 내용 분석과 키워드 분석을 중심으로
Moonsun Choi
Journal of Linguistic Studies | VOL. 24
Moonsun ChoiMoonsun Choi
30 Apr 2019
Journal of Linguistic Studies | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic evaluation of machine translation quality using n-gram co-occurrence statistics

Abstract

Talk to us

Similar Papers