Discriminative Training for Log-Linear Based SMT

Lemao Liu,Conghui Zhu,Taro Watanabe,Tiejun Zhao,Hailong Cao

doi:10.1145/2637478

Abstract

In statistical machine translation, the standard methods such as MERT tune a single weight with regard to a given development data. However, these methods suffer from two problems due to the diversity and uneven distribution of source sentences. First, their performance is highly dependent on the choice of a development set, which may lead to an unstable performance for testing. Second, the sentence level translation quality is not assured since tuning is performed on the document level rather than on sentence level. In contrast with the standard global training in which a single weight is learned, we propose novel local training methods to address these two problems. We perform training and testing in one step by locally learning the sentence-wise weight for each input sentence. Since the time of each tuning step is unnegligible and learning sentence-wise weights for the entire test set means many passes of tuning, it is a great challenge for the efficiency of local training. We propose an efficient two-phase method to put the local training into practice by employing the ultraconservative update. On NIST Chinese-to-English translation tasks with both medium and large scales of training data, our local training methods significantly outperform standard methods with the maximal improvements up to 2.0 BLEU points, meanwhile their efficiency is comparable to that of the standard methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discriminative Training for Log-Linear Based SMT

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian Language Information Processing

Lead the way for us

Similar Papers

ParFDA for Instance Selection for Statistical Machine Translation
Ergun Bicici
-
Ergun BiciciErgun Bicici
01 Jan 2015
01 Jan 2015

Optimizing Instance Selection for Statistical Machine Translation with Feature Decay Algorithms
Ergun Bicici ... Deniz Yuret
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23
Ergun Bicici, et. al.Ergun Bicici ... Deniz Yuret
01 Feb 2015
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 23

Bi-Text Alignment of Movie Subtitles for English-Arabic Statistical Machine Translation
Fahad Ahmed Al-Obaidli ... Stephen Cox
-
Fahad Ahmed Al-Obaidli, et. al.Fahad Ahmed Al-Obaidli ... Stephen Cox
01 Jan 2015
01 Jan 2015

Improved Unsupervised Statistical Machine Translation via Unsupervised Word Sense Disambiguation for a Low-Resource and Indic Languages
Shefali Saxena ... Philemon Daniel
IETE Journal of Research | VOL. 69
Shefali Saxena, et. al.Shefali Saxena ... Philemon Daniel
23 Jul 2022
IETE Journal of Research | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative Training for Log-Linear Based SMT

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian Language Information Processing