Learning Tractable Word Alignment Models with Complex Constraints

João V Graça,Kuzman Ganchev,Ben Taskar

doi:10.1162/coli_a_00007

Abstract

Word-level alignment of bilingual text is a critical resource for a growing variety of tasks. Probabilistic models for word alignment present a fundamental trade-off between richness of captured constraints and correlations versus efficiency and tractability of inference. In this article, we use the Posterior Regularization framework (Graça, Ganchev, and Taskar 2007) to incorporate complex constraints into probabilistic models during learning without changing the efficiency of the underlying model. We focus on the simple and tractable hidden Markov model, and present an efficient learning algorithm for incorporating approximate bijectivity and symmetry constraints. Models estimated with these constraints produce a significant boost in performance as measured by both precision and recall of manually annotated alignments for six language pairs. We also report experiments on two different tasks where word alignments are required: phrase-based machine translation and syntax transfer, and show promising improvements over standard methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Tractable Word Alignment Models with Complex Constraints

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics

Lead the way for us

Journal: Computational Linguistics	Publication Date: Sep 1, 2010
Citations: 36

Similar Papers

HMM Word and Phrase Alignment for Statistical Machine Translation
Yonggang Deng ... W Byrne
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16
Yonggang Deng, et. al. Yonggang Deng ... W Byrne
01 Mar 2008
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 16

A model for fine-grained alignment of multilingual texts
Lea Cyrus ... Hendrik Feddes
-
Lea Cyrus, et. al.Lea Cyrus ... Hendrik Feddes
01 Jan 2004
01 Jan 2004

A Bidirectional Transformer Based Alignment Model for Unsupervised Word Alignment
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

A Systematic Comparison between Various Statistical Alignment Models for Statistical English-Vietnamese Phrase-Based Translation
Cuong Hoang ... Son Bao Pham
-
Cuong Hoang, et. al.Cuong Hoang ... Son Bao Pham
01 Aug 2012
01 Aug 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Tractable Word Alignment Models with Complex Constraints

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics