Optimization for Statistical Machine Translation: A Survey

Graham Neubig,Taro Watanabe

doi:10.1162/coli_a_00241

Graham Neubig, Taro Watanabe

Open Access

https://doi.org/10.1162/coli_a_00241

Copy DOI

Abstract

In statistical machine translation (SMT), the optimization of the system parameters to maximize translation accuracy is now a fundamental part of virtually all modern systems. In this article, we survey 12 years of research on optimization for SMT, from the seminal work on discriminative models (Och and Ney 2002) and minimum error rate training (Och 2003), to the most recent advances. Starting with a brief introduction to the fundamentals of SMT systems, we follow by covering a wide variety of optimization algorithms for use in both batch and online optimization. Specifically, we discuss losses based on direct error minimization, maximum likelihood, maximum margin, risk minimization, ranking, and more, along with the appropriate methods for minimizing these losses. We also cover recent topics, including large-scale optimization, nonlinear models, domain-dependent optimization, and the effect of MT evaluation measures or search on optimization. Finally, we discuss the current state of affairs in MT optimization, and point out some unresolved problems that will likely be the target of further research in optimization for MT.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Linguistics	Publication Date: Mar 1, 2016
Citations: 60	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Optimization for Statistical Machine Translation: A Survey

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics

Lead the way for us

Similar Papers

Softmax-margin training for statistical machine translation
Wenwen Zhang ... Hailong Cao
-
Wenwen Zhang, et. al.Wenwen Zhang ... Hailong Cao
01 May 2012
01 May 2012

Random restarts in minimum error rate training for statistical machine translation
Robert C Moore ... Chris Quirk
-
Robert C Moore, et. al.Robert C Moore ... Chris Quirk
01 Jan 2008
01 Jan 2008

Adaptation in Statistical Machine Translation for Low-resource Domains in English-Vietnamese Language
Nghia-Luan Pham ... Van-Vinh Nguyen
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Nghia-Luan Pham, et. al.Nghia-Luan Pham ... Van-Vinh Nguyen
30 May 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Improving the Performance of Low-Resource SMT Using Neural-Inspired Sentence Generator
Nirmal Kumar ... K Mrinalini
-
Nirmal Kumar, et. al.Nirmal Kumar ... K Mrinalini
01 Feb 2018
01 Feb 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimization for Statistical Machine Translation: A Survey

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics