Categorical Metadata Representation for Customized Text Classification

Jihyeok Kim,Kyungjae Lee,Sua Sung,Seung-Won Hwang,Minji Seo,Reinald Kim Amplayo

doi:10.1162/tacl_a_00263

Abstract

The performance of text classification has improved tremendously using intelligently engineered neural-based models, especially those injecting categorical metadata as additional information, e.g., using user/product information for sentiment classification. This information has been used to modify parts of the model (e.g., word embeddings, attention mechanisms) such that results can be customized according to the metadata. We observe that current representation methods for categorical metadata, which are devised for human consumption, are not as effective as claimed in popular classification methods, outperformed even by simple concatenation of categorical features in the final layer of the sentence encoder. We conjecture that categorical features are harder to represent for machine use, as available context only indirectly describes the category, and even such context is often scarce (for tail category). To this end, we propose using basis vectors to effectively incorporate categorical metadata on various parts of a neural-based model. This additionally decreases the number of parameters dramatically, especially when the number of categorical features is large. Extensive experiments on various data sets with different properties are performed and show that through our method, we can represent categorical metadata more effectively to customize parts of the model, including unexplored ones, and increase the performance of the model greatly.

Highlights

Text classification is the backbone of most NLP tasks: review classification in sentiment analysis (Pang et al, 2002), paper classification in scientific data discovery (Sebastiani, 2002), and question classification in question answering (Li and Roth, 2002), to name a few
Metadata is generated for human understanding, and we claim that these categories need to be carefully represented for machine use to Transactions of the Association for Computational Linguistics, vol 7, pp. 201–215, 2019
We present five levels of Customized Bidirectional Long Short Term Memory (BiLSTM), which differ on the location where we inject the categorical features, listed here from the highest level to the lowest level of dependencies between text and categories

Summary

Introduction

Text classification is the backbone of most NLP tasks: review classification in sentiment analysis (Pang et al, 2002), paper classification in scientific data discovery (Sebastiani, 2002), and question classification in question answering (Li and Roth, 2002), to name a few. We are inspired by the advancement in neural-based models, incorporating categorical information ‘‘as is’’ and injecting it on various parts of the model such as in the word embeddings (Tang et al, 2015), attention mechanism (Chen et al, 2016; Amplayo et al, 2018a) and memory networks (Dou, 2017). These methods theoretically make use of combined features from both textual and categorical features, which make them more powerful than disconnected features.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Nov 1, 2019
Citations: 52	License type: cc-by

R Discovery Prime

R Discovery Prime

Categorical Metadata Representation for Customized Text Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Machine Learning-Based Models for Basic Sediment & Water and Sand-Cut Prediction in Matured Niger Delta Fields
Frank A Abuh ... Anietie N Okon
Journal of Energy Research and Reviews | VOL. 15
Frank A Abuh, et. al.Frank A Abuh ... Anietie N Okon
19 Oct 2023
Journal of Energy Research and Reviews | VOL. 15

A Multi-task Text Classification Model Based on Label Embedding Learning
Yuemei Xu ... Zuwei Fan
-
Yuemei Xu, et. al.Yuemei Xu ... Zuwei Fan
01 Jan 2021
01 Jan 2021

A Heterogeneous Directed Graph Attention Network for inductive text classification using multilevel semantic embeddings
Mu Lin ... Weiping Wang
Knowledge-Based Systems | VOL. 295
Mu Lin, et. al.Mu Lin ... Weiping Wang
12 Apr 2024
Knowledge-Based Systems | VOL. 295

Bidirectional LSTM with attention mechanism and convolutional layer for text classification
Gang Liu ... Jiabao Guo
Neurocomputing | VOL. 337
Gang Liu, et. al.Gang Liu ... Jiabao Guo
01 Feb 2019
Neurocomputing | VOL. 337

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Categorical Metadata Representation for Customized Text Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics