RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation

Víctor M Sánchez-Cartagena,Juan Antonio Pérez-Ortiz,Felipe Sánchez-Martínez

doi:10.1515/pralin-2016-0018

Víctor M Sánchez-Cartagena, Juan Antonio Pérez-Ortiz + Show 1 more

Open Access

https://doi.org/10.1515/pralin-2016-0018

Copy DOI

Abstract

Abstract This paper presents ruLearn, an open-source toolkit for the automatic inference of rules for shallow-transfer machine translation from scarce parallel corpora and morphological dictionaries. ruLearn will make rule-based machine translation a very appealing alternative for under-resourced language pairs because it avoids the need for human experts to handcraft transfer rules and requires, in contrast to statistical machine translation, a small amount of parallel corpora (a few hundred parallel sentences proved to be sufficient). The inference algorithm implemented by ruLearn has been recently published by the same authors in Computer Speech & Language (volume 32). It is able to produce rules whose translation quality is similar to that obtained by using hand-crafted rules. ruLearn generates rules that are ready for their use in the Apertium platform, although they can be easily adapted to other platforms. When the rules produced by ruLearn are used together with a hybridisation strategy for integrating linguistic resources from shallow-transfer rule-based machine translation into phrase-based statistical machine translation (published by the same authors in Journal of Artificial Intelligence Research, volume 55), they help to mitigate data sparseness. This paper also shows how to use ruLearn and describes its implementation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: The Prague Bulletin of Mathematical Linguistics	Publication Date: Oct 1, 2016
Citations: 1	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation

Abstract

Talk to us

Similar Papers

More From: The Prague Bulletin of Mathematical Linguistics

Lead the way for us

Similar Papers

A generalised alignment template formalism and its application to the inference of shallow-transfer machine translation rules from scarce bilingual corpora
Víctor M Sánchez-Cartagena ... Felipe Sánchez-Martínez
Computer Speech & Language | VOL. 32
Víctor M Sánchez-Cartagena, et. al.Víctor M Sánchez-Cartagena ... Felipe Sánchez-Martínez
07 Nov 2014
Computer Speech & Language | VOL. 32

Tighter integration of rule-based and statistical MT in serial system combination
Nicola Ueffing ... Evgeny Matusov
-
Nicola Ueffing, et. al.Nicola Ueffing ... Evgeny Matusov
01 Jan 2008
01 Jan 2008

Integrating Rules and Dictionaries from Shallow-Transfer Machine Translation into Phrase-Based Statistical Machine Translation
Víctor M Sánchez-Cartagena ... Juan Antonio Pérez-Ortiz
Journal of Artificial Intelligence Research | VOL. 55
Víctor M Sánchez-Cartagena, et. al.Víctor M Sánchez-Cartagena ... Juan Antonio Pérez-Ortiz
13 Jan 2016
Journal of Artificial Intelligence Research | VOL. 55

Statistical vs. Rule-Based Machine Translation: A Comparative Study on Indian Languages
S Sreelekha ... Pushpak Bhattacharyya
-
S Sreelekha, et. al.S Sreelekha ... Pushpak Bhattacharyya
28 Dec 2017
28 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RuLearn: an Open-source Toolkit for the Automatic Inference of Shallow-transfer Rules for Machine Translation

Abstract

Talk to us

Similar Papers

More From: The Prague Bulletin of Mathematical Linguistics