Discriminative Word Alignment by Linear Modeling

Yang Liu,Shouxun Lin,Qun Liu

doi:10.1162/coli_a_00001

Abstract

Word alignment plays an important role in many NLP tasks as it indicates the correspondence between words in a parallel text. Although widely used to align large bilingual corpora, generative models are hard to extend to incorporate arbitrary useful linguistic information. This article presents a discriminative framework for word alignment based on a linear model. Within this framework, all knowledge sources are treated as feature functions, which depend on a source language sentence, a target language sentence, and the alignment between them. We describe a number of features that could produce symmetric alignments. Our model is easy to extend and can be optimized with respect to evaluation metrics directly. The model achieves state-of-the-art alignment quality on three word alignment shared tasks for five language pairs with varying divergence and richness of resources. We further show that our approach improves translation performance for various statistical machine translation systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Linguistics	Publication Date: Sep 1, 2010
Citations: 49	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Discriminative Word Alignment by Linear Modeling

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics

Lead the way for us

Similar Papers

Integrating Word Embeddings into IBM Word Alignment Models
Anh-Cuong Le ... Dao Bao Linh
-
Anh-Cuong Le, et. al.Anh-Cuong Le ... Dao Bao Linh
01 Nov 2018
01 Nov 2018

A Hybrid Approach for Word Alignment with Statistical Modeling and Chunker
Jyoti Srivastava ... Sudip Sanyal
-
Jyoti Srivastava, et. al.Jyoti Srivastava ... Sudip Sanyal
01 Jan 2015
01 Jan 2015

What types of word alignment improve statistical machine translation?
Patrik Lambert ... Yanjun Ma
Machine Translation | VOL. 26
Patrik Lambert, et. al.Patrik Lambert ... Yanjun Ma
10 Mar 2012
Machine Translation | VOL. 26

Preordering using a Target-Language Parser via Cross-Language Syntactic Projection for Statistical Machine Translation
Isao Goto ... Eiichiro Sumita
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 14
Isao Goto, et. al.Isao Goto ... Eiichiro Sumita
12 Jun 2015
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative Word Alignment by Linear Modeling

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics