Learning structural dependencies of words in the Zipfian Tail

T. Deoskar,K. Sima'an,M. Mylonakis

doi:10.1093/logcom/exs062

Abstract

This article uses semi-supervised Expectation Maximization (EM) to learn lexico-syntactic dependencies, i.e. associations between words and the structures that occur with them. Due to Zipfian distributions in language, such dependencies are extremely sparse in labelled data, and unlabelled data are the only source for learning them. Specifically, we learn sparse lexical parameters of a generative parsing model (a Probabilistic Context-Free Grammar, PCFG) that is initially estimated over the Penn Treebank. Our lexical parameters are similar to supertags - they are fine-grained, and encode complex structural information at the pre-terminal level. Our goal is to use unlabelled data to learn these for words that are rare or unseen in the labelled data. We get large error reductions (up to 17.5%) in parsing ambiguous structures associated with unseen verbs, the most important case of learning lexico-structural dependencies, resulting in a statistically significant improvement in labelled bracketing score of the treebank PCFG. Our semi-supervised method incorporates structural and lexical priors from the labelled data to guide estimation from unlabelled data, and is the first successful use of semi-supervised EM to improve a generative structured model already trained over large labelled data. The method scales well to larger amounts of unlabelled data, and also gives substantial error reductions (up to 11.5%) for models trained on smaller amounts of labelled data, making it relevant to low-resource languages with small treebanks as well.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Logic and Computation	Publication Date: Jan 3, 2013
Citations: 3	License type: mit

R Discovery Prime

R Discovery Prime

Learning structural dependencies of words in the Zipfian Tail

Abstract

Talk to us

Similar Papers

More From: Journal of Logic and Computation

Lead the way for us

Similar Papers

Semi-Supervised Learning
Tobias Scheffer
-
Tobias SchefferTobias Scheffer
01 Jan 2009
01 Jan 2009

Assessing the impact of satellite, aircraft, and surface observations on CO2flux estimation using an ensemble-based 4-D data assimilation system
Kazuyuki Miyazaki ... Prabir Patra
Journal of Geophysical Research | VOL. 116
Kazuyuki Miyazaki, et. al.Kazuyuki Miyazaki ... Prabir Patra
23 Aug 2011
Journal of Geophysical Research | VOL. 116

Combining deep generative and discriminative models for Bayesian semi-supervised learning
Jonathan Gordon ... José Miguel Hernández-Lobato
Pattern Recognition | VOL. 100
Jonathan Gordon, et. al.Jonathan Gordon ... José Miguel Hernández-Lobato
14 Dec 2019
Pattern Recognition | VOL. 100

EM Learning for Symbolic-Statistical Models in Statistical Abduction
Taisuke Sato
-
Taisuke SatoTaisuke Sato
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning structural dependencies of words in the Zipfian Tail

Abstract

Talk to us

Similar Papers

More From: Journal of Logic and Computation