Measuring Machine Translation Errors in New Domains

Ann Irvine,Hal Daumé,Marine Carpuat,John Morgan,Dragos Munteanu

doi:10.1162/tacl_a_00239

Abstract

We develop two techniques for analyzing the effect of porting a machine translation system to a new domain. One is a macro-level analysis that measures how domain shift affects corpus-level evaluation; the second is a micro-level analysis for word-level errors. We apply these methods to understand what happens when a Parliament-trained phrase-based machine translation system is applied in four very different domains: news, medical texts, scientific articles and movie subtitles. We present quantitative and qualitative experiments that highlight opportunities for future research in domain adaptation for machine translation.

Highlights

When building a statistical machine translation (SMT) system, the expected use case is often limited to a specific domain, genre and register ( “domain” refers to this set, in keeping with standard, imprecise, terminology), such as a particular type of legal or medical document
One important feature of our methodologies is that we focus on errors that could possibly be fixed given access to data from a new domain, rather than all errors that might arise because the particular translation model used is inadequate to capture the required
Adapting an SMT system from the Parliament domain to the news domain is not a representative adaptation task; there are a very small number of errors due to unseen words, which are minor in comparison to all other domains. (Despite the fact that most previous work focuses exclusively on using news as a “new” domain, §3). 2

Summary

Introduction

When building a statistical machine translation (SMT) system, the expected use case is often limited to a specific domain, genre and register ( “domain” refers to this set, in keeping with standard, imprecise, terminology), such as a particular type of legal or medical document. It is expensive to obtain enough parallel data to reliably estimate translation models in a new domain. One can hope that large amounts of data from another, “old domain,” might be close enough to stand as a proxy. This is the defacto standard: we train SMT systems on Parliament proceedings, but use them to translate all sorts of new text. This results in significantly degraded translation quality. We show quantitative (§7.1) and qualitative (§7.2) results obtained from our methods on

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2013
Citations: 79	License type: cc-by

R Discovery Prime

R Discovery Prime

Measuring Machine Translation Errors in New Domains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

QCRI's Live Speech Translation System
Fahim Dalvi ... Stephan Vogel
-
Fahim Dalvi, et. al.Fahim Dalvi ... Stephan Vogel
01 Jan 2018
01 Jan 2018

Combining Machine Translated Sentence Chunks from Multiple MT Systems
Matīss Rikters ... Inguna Skadiņa
-
Matīss Rikters, et. al.Matīss Rikters ... Inguna Skadiņa
01 Jan 2018
01 Jan 2018

Quantitative fine-grained human evaluation of machine translation systems: a case study on English to Croatian
Filip Klubička ... Víctor M Sánchez-Cartagena
Machine Translation | VOL. 32
Filip Klubička, et. al.Filip Klubička ... Víctor M Sánchez-Cartagena
10 Feb 2018
Machine Translation | VOL. 32

A novel and robust approach for pro-drop language translation
Longyue Wang ... Xiaojun Zhang
Machine Translation | VOL. 31
Longyue Wang, et. al.Longyue Wang ... Xiaojun Zhang
13 Jan 2017
Machine Translation | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Measuring Machine Translation Errors in New Domains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics