Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation

Eun-Jin Park,Oh-Woog Kwon,Young-Kil Kim,Kangil Kim

doi:10.4218/etrij.15.0114.1017

Abstract

In this paper, we propose a classification-based approach for hybridizing statistical machine translation and rulebased machine translation. Both the training dataset used in the learning of our proposed classifier and our feature extraction method affect the hybridization quality. To create one such training dataset, a previous approach used auto-evaluation metrics to determine from a set of component machine translation (MT) systems which gave the more accurate translation (by a comparative method). Once this had been determined, the most accurate translation was then labelled in such a way so as to indicate the MT system from which it came. In this previous approach, when the metric evaluation scores were low, there existed a high level of uncertainty as to which of the component MT systems was actually producing the better translation. To relax such uncertainty or error in classification, we propose an alternative approach to such labeling; that is, a cut-off method. In our experiments, using the aforementioned cut-off method in our proposed classifier, we managed to achieve a translation accuracy of 81.5% - a 5.0% improvement over existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation

Abstract

Talk to us

Similar Papers

More From: ETRI Journal

Lead the way for us

Journal: ETRI Journal	Publication Date: Jun 1, 2015
Citations: 5

Similar Papers

Training, Enhancing, Evaluating and Using MT Systems with Comparable Data
Bogdan Babych ... Mārcis Pinnis
-
Bogdan Babych, et. al.Bogdan Babych ... Mārcis Pinnis
01 Jan 2019
01 Jan 2019

Handling Multi-word Expressions Without Explicit Linguistic Rules in an MT System
Akshar Bharati ... Rajeev Sangal
-
Akshar Bharati, et. al.Akshar Bharati ... Rajeev Sangal
01 Jan 2004
01 Jan 2004

Statistical vs. Rule-Based Machine Translation: A Comparative Study on Indian Languages
S Sreelekha ... Pushpak Bhattacharyya
-
S Sreelekha, et. al.S Sreelekha ... Pushpak Bhattacharyya
28 Dec 2017
28 Dec 2017

Tighter integration of rule-based and statistical MT in serial system combination
Nicola Ueffing ... Evgeny Matusov
-
Nicola Ueffing, et. al.Nicola Ueffing ... Evgeny Matusov
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation

Abstract

Talk to us

Similar Papers

More From: ETRI Journal