Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT

Adam Lopez,Debu Sinha,Leili Shahriyari,Matt Post,Narges Ahmidi,Daguang Xu,Olivia Buzek,Juri Ganitkevitch,Chris Callison-Burch,Jonathan Weese,Lin Yang,M Weinberger ,Beaniesh Jamil ,Stephen G Wampler ,Matthias Lee ,Yating Lin ,F Castillo Rivera ,Henry Pao ,Adam R Teichert ,Leah R Hanson ,Shifang Zhao

doi:10.1162/tacl_a_00218

Abstract

Machine translation (MT) draws from several different disciplines, making it a complex subject to teach. There are excellent pedagogical texts, but problems in MT and current algorithms for solving them are best learned by doing. As a centerpiece of our MT course, we devised a series of open-ended challenges for students in which the goal was to improve performance on carefully constrained instances of four key MT tasks: alignment, decoding, evaluation, and reranking. Students brought a diverse set of techniques to the problems, including some novel solutions which performed remarkably well. A surprising and exciting outcome was that student solutions or their combinations fared competitively on some tasks, demonstrating that even newcomers to the field can help improve the state-of-the-art on hard NLP problems while simultaneously learning a great deal. The problems, baseline code, and results are freely available.

Highlights

A decade ago, students interested in natural language processing arrived at universities having been exposed to the idea of machine translation (MT) primarily through science fiction
We provided three simple Python programs: evaluate implements a simple ranking of the systems based on position-independent word error rate (PER; Tillmann et al, 1997), which computes a bagof-words overlap between the system translations and the reference
The best submission, obtaining a correlation of 83.5, relied on the idea that the reference and machine translation should be good paraphrases of each other (Owczarzak et al, 2006; Kauchak and Barzilay, 2006). It employed a simple paraphrase system trained on the alignment challenge data, using the pivot technique of Bannard and CallisonBurch (2005), and computing the optimal alignment between machine translation and reference under a simple model in which words could align if they were paraphrases

Summary

Introduction

A decade ago, students interested in natural language processing arrived at universities having been exposed to the idea of machine translation (MT) primarily through science fiction. Today, incoming students have been exposed to services like Google Translate since they were in secondary school or earlier. It makes sense to teach statistical MT, either on its own or as a unit in a class on natural language processing (NLP), machine learning (ML), or artificial intelligence (AI). A course that promises to show students how Google Translate works and teach them how to build something like it is especially appealing, and several universities and summer schools offer such classes. There are excellent introductory texts—depending on the level of detail required, instructors can choose from a comprehensive MT textbook (Koehn, 2010), a chapter of a popular NLP textbook (Jurafsky and Martin, 2009), a tutorial survey (Lopez, 2008), or an intuitive tutorial on the IBM Models (Knight, 1999b), among many others

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2013
Citations: 53	License type: cc-by

R Discovery Prime

R Discovery Prime

Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Research on the Application of Prompt Learning Pretrained Language Model in Machine Translation Task with Reinforcement Learning
Canjun Wang ... Tong Chen
Electronics | VOL. 12
Canjun Wang, et. al.Canjun Wang ... Tong Chen
09 Aug 2023
Electronics | VOL. 12

An Effective Ensemble Model Related to Incremental Learning in Neural Machine Translation
Pumeng Shi
-
Pumeng ShiPumeng Shi
01 Jan 2023
01 Jan 2023

ISTIC’s Neural Machine Translation System for CCMT’ 2021
Hangcheng Guo ... Zhenfeng Wu
-
Hangcheng Guo, et. al.Hangcheng Guo ... Zhenfeng Wu
01 Jan 2020
ISTIC’s Neural Machine Translation System for CCMT’ 2021
Hangcheng Guo ... Zhenfeng Wu

ISTIC’s Neural Machine Translation System for IWSLT’2020
Jiaze Wei ... Zhenfeng Wu
-
Jiaze Wei, et. al.Jiaze Wei ... Zhenfeng Wu
01 Jan 2020
ISTIC’s Neural Machine Translation System for IWSLT’2020
Jiaze Wei ... Zhenfeng Wu

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to translate with products of novices: a suite of open-ended challenge problems for teaching MT

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics