Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

Chenchen Ding,Mikio Yamamoto,Hirona Touji,Keisuke Sakanushi

doi:10.1145/2818381

Chenchen Ding, Mikio Yamamoto + Show 2 more

https://doi.org/10.1145/2818381

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

A rule-based pre-ordering approach is proposed for statistical Japanese-to-English machine translation using the dependency structure of source-side sentences. A Japanese sentence is pre-ordered to an English-like order at the morpheme level for a statistical machine translation system during the training and decoding phase to resolve the reordering problem. In this article, extra-chunk pre-ordering of morphemes is proposed, which allows Japanese functional morphemes to move across chunk boundaries. This contrasts with the intra-chunk reordering used in previous approaches, which restricts the reordering of morphemes within a chunk. Linguistically oriented discussions show that correct pre-ordering cannot be realized without extra-chunk movement of morphemes. The proposed approach is compared with five rule-based pre-ordering approaches designed for Japanese-to-English translation and with a language independent statistical pre-ordering approach on a standard patent dataset and on a news dataset obtained by crawling Internet news sites. Two state-of-the-art statistical machine translation systems, one phrase-based and the other hierarchical phrase-based, are used in experiments. Experimental results show that the proposed approach outperforms the compared approaches on automatic reordering measures (Kendall’s τ, Spearman’s ρ, fuzzy reordering score, and test set RIBES) and on the automatic translation precision measure of test set BLEU score.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Jan 9, 2016
Citations: 9

Similar Papers

Hybrid data-driven models of machine translation
Declan Groves ... Andy Way
Machine Translation | VOL. 19
Declan Groves, et. al.Declan Groves ... Andy Way
02 Nov 2006
Machine Translation | VOL. 19

Using Statistical Machine Translation to Grade Training Data
Andrew Finch ... Eiichiro Sumita
-
Andrew Finch, et. al.Andrew Finch ... Eiichiro Sumita
01 Dec 2008
01 Dec 2008

Training, Enhancing, Evaluating and Using MT Systems with Comparable Data
Bogdan Babych ... Andrejs Vasiļjevs
-
Bogdan Babych, et. al.Bogdan Babych ... Andrejs Vasiļjevs
01 Jan 2019
01 Jan 2019

Statistical vs. Rule-Based Machine Translation: A Comparative Study on Indian Languages
S Sreelekha ... Pushpak Bhattacharyya
-
S Sreelekha, et. al.S Sreelekha ... Pushpak Bhattacharyya
28 Dec 2017
28 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Inter-, Intra-, and Extra-Chunk Pre-Ordering for Statistical Japanese-to-English Machine Translation

Abstract

Published Version

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing