Linguistic-Relationships-Based Approach for Improving Word Alignment

Phuoc Tran,Long H B Nguyen,Tan Le,Dien Dinh

doi:10.1145/3133323

Abstract

The unsupervised word alignments (such as GIZA++) are widely used in the phrase-based statistical machine translation. The quality of the model is proportional to the size and the quality of the bilingual corpus. However, for low-resource language pairs such as Chinese and Vietnamese, a result of unsupervised word alignment sometimes is of low quality due to the sparse data. In addition, this model does not take advantage of the linguistic relationships to improve performance of word alignment. Chinese and Vietnamese have the same language type and have close linguistic relationships. In this article, we integrate the characteristics of linguistic relationships into the word alignment model to enhance the quality of Chinese-Vietnamese word alignment. These linguistic relationships are Sino-Vietnamese and content word. The experimental results showed that our method improved the performance of word alignment as well as the quality of machine translation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Linguistic-Relationships-Based Approach for Improving Word Alignment

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Oct 14, 2017
Citations: 4

Similar Papers

Hybrid Word Alignment
Santanu Pal ... Sudip Kumar Naskar
-
Santanu Pal, et. al.Santanu Pal ... Sudip Kumar Naskar
01 Jan 2015
01 Jan 2015

Evaluation of Source to Target and Target to Source Word Alignment for English to Hindi
Arun R Babhulgaonkar ... Shefali P Sonavane
-
Arun R Babhulgaonkar, et. al.Arun R Babhulgaonkar ... Shefali P Sonavane
01 Jan 2020
01 Jan 2020

Exploiting Parallel Treebanks to Improve Phrase-Based Statistical Machine Translation
John Tinsley ... Mary Hearne
-
John Tinsley, et. al.John Tinsley ... Mary Hearne
01 Jan 2009
01 Jan 2009

A Bilingual Word Alignment Method of Chinese-English based on Recurrent Neural Network
Jingsong Xiang ... Sheng Huang
-
Jingsong Xiang, et. al.Jingsong Xiang ... Sheng Huang
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Linguistic-Relationships-Based Approach for Improving Word Alignment

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing