Generalised Brown Clustering and Roll-Up Feature Generation

Leon Derczynski,Sean Chester

doi:10.1609/aaai.v30i1.10190

Abstract

Brown clustering is an established technique, used in hundreds of computational linguistics papers each year, to group word types that have similar distributional information. It is unsupervised and can be used to create powerful word representations for machine learning. Despite its improbable success relative to more complex methods, few have investigated whether Brown clustering has really been applied optimally. In this paper, we present a subtle but profound generalisation of Brown clustering to improve the overall quality by decoupling the number of output classes from the computational active set size. Moreover, the generalisation permits a novel approach to feature selection from Brown clusters: We show that the standard approach of shearing the Brown clustering output tree at arbitrary bitlengths is lossy and that features should be chosen insead by rolling up Generalised Brown hierarchies. The generalisation and corresponding feature generation is more principled, challenging the way Brown clustering is currently understood and applied.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalised Brown Clustering and Roll-Up Feature Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Feb 21, 2016
Citations: 11

Similar Papers

Large-scaleword representation features for improved spoken language understanding
Jun Zhang ... Timothy J Hazen
-
Jun Zhang, et. al.Jun Zhang ... Timothy J Hazen
01 Apr 2015
01 Apr 2015

Quantifying the morphosyntactic content of Brown Clusters
Manuel Ciosici ... Leon Derczynski
-
Manuel Ciosici, et. al.Manuel Ciosici ... Leon Derczynski
01 Jan 2019
01 Jan 2019

A comparison of conditional random fields and structured support vector machines for chemical entity recognition in biomedical literature.
Buzhou Tang ... Min Jiang
Journal of Cheminformatics | VOL. 7
Buzhou Tang, et. al.Buzhou Tang ... Min Jiang
19 Jan 2015
Journal of Cheminformatics | VOL. 7

Tailoring Continuous Word Representations for Dependency Parsing
Mohit Bansal ... Kevin Gimpel
-
Mohit Bansal, et. al.Mohit Bansal ... Kevin Gimpel
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalised Brown Clustering and Roll-Up Feature Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence