GILE: A Generalized Input-Label Embedding for Text Classification

Nikolaos Pappas,James Henderson

doi:10.1162/tacl_a_00259

Nikolaos Pappas, James Henderson

Open Access

PDF Available

https://doi.org/10.1162/tacl_a_00259

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Neural text classification models typically treat output labels as categorical variables that lack description and semantics. This forces their parametrization to be dependent on the label set size, and, hence, they are unable to scale to large label sets and generalize to unseen ones. Existing joint input-label text models overcome these issues by exploiting label descriptions, but they are unable to capture complex label relationships, have rigid parametrization, and their gains on unseen labels happen often at the expense of weak performance on the labels seen during training. In this paper, we propose a new input-label model that generalizes over previous such models, addresses their limitations, and does not compromise performance on seen labels. The model consists of a joint nonlinear input-label embedding with controllable capacity and a joint-space-dependent classification unit that is trained with cross-entropy loss to optimize classification performance. We evaluate models on full-resource and low- or zero-resource text classification of multilingual news and biomedical text with a large label set. Our model outperforms monolingual and multilingual models that do not leverage label semantics and previous joint input-label space models in both scenarios.

Highlights

Text classification is a fundamental NLP task with numerous real-world applications such as topic recognition (Tang et al, 2015; Yang et al, 2016), sentiment analysis (Pang and Lee, 2005; Yang et al, 2016), and question answering (Chen et al, 2015; Kumar et al, 2015)
To encode the input text, we focus on hierarchical attention networks (HANs), which are competitive for monolingual (Yang et al, 2016) and multilingual text classification (Pappas and Popescu-Belis, 2017)
GILE-word-level attention neural network (WAN) outperforms WSABIE+ and AiTextML variants6 by a large margin in both cases—for example, by +7.75, +11.61 points on seen labels and by +12.58, +10.29 points in terms of average precision on unseen labels, respectively

Summary

Introduction

Text classification is a fundamental NLP task with numerous real-world applications such as topic recognition (Tang et al, 2015; Yang et al, 2016), sentiment analysis (Pang and Lee, 2005; Yang et al, 2016), and question answering (Chen et al, 2015; Kumar et al, 2015). Previous work has leveraged knowledge from the label texts through a joint input-label space, initially for image classification (Weston et al, 2011; Mensink et al, 2012; Frome et al, 2013; Socher et al, 2013). Such models generalize to labels both seen and unseen during training, and scale well on very large label sets. The word level is made of an encoder network gw and an attention network aw, while the sentence level includes an encoder and an attention network

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Nov 1, 2019
Citations: 61	License type: cc-by

R Discovery Prime

GILE: A Generalized Input-Label Embedding for Text Classification

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

ML-Net: multi-label classification of biomedical texts with deep neural networks.
Jingcheng Du ... Cui Tao
Journal of the American Medical Informatics Association | VOL. 26
Jingcheng Du, et. al.Jingcheng Du ... Cui Tao
24 Jun 2019
Journal of the American Medical Informatics Association | VOL. 26

Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? {A} Comprehensive Assessment for {C}atalan
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Improving sentence representation for vietnamese natural language understanding using optimal transport
Phu Xuan-Vinh Nguyen ... Kiet Van Nguyen
Journal of Intelligent & Fuzzy Systems | VOL. 45
Phu Xuan-Vinh Nguyen, et. al.Phu Xuan-Vinh Nguyen ... Kiet Van Nguyen
02 Dec 2023
Journal of Intelligent & Fuzzy Systems | VOL. 45

MRF Labeling with a Graph-Shifts Algorithm
Jason J Corso ... Alan Yuille
-
Jason J Corso, et. al.Jason J Corso ... Alan Yuille
24 Oct 2011
24 Oct 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

GILE: A Generalized Input-Label Embedding for Text Classification

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics