Short-text Classification Tasks Research Articles

Short text classification task is a special kind of text classification task in that the text to be classified is generally short, typically generating a sparse text representation that lacks rich semantic information. Given this shortcoming, scholars worldwide have explored improved short text classification methods based on deep learning. However, existing methods cannot effectively use concept knowledge and long-distance word dependencies. Therefore, based on graph neural networks from the perspective of text composition, we propose concept and dependencies enhanced graph convolutional networks for short text classification. First, the co-occurrence relationship between words is obtained by sliding window, the inclusion relationship between documents and words is obtained by TF-IDF, long-distance word dependencies is obtained by Stanford CoreNLP, and the association relationship between concepts in the concept graph with entities in the text is obtained through Microsoft Concept Graph. Then, a text graph is constructed for an entire text corpus based on the four relationships. Finally, the text graph is input into graph convolutional neural networks, and the category of each document node is predicted after two layers of convolution. Experimental results demonstrate that our proposed method overall best on multiple classical English text classification datasets.

Short and sparse texts such as tweets, search engine snippets, product reviews, and chat messages are abundant on the Web. Classifying such short-texts into a pre-defined set of categories is a common problem that arises in various contexts, such as sentiment classification, spam detection, and information recommendation. The fundamental problem in short-text classification is feature sparseness -- the lack of feature overlap between a trained model and a test instance to be classified. We propose ClassiNet -- a network of classifiers trained for predicting missing features in a given instance, to overcome the feature sparseness problem. Using a set of unlabeled training instances, we first learn binary classifiers as feature predictors for predicting whether a particular feature occurs in a given instance. Next, each feature predictor is represented as a vertex v i in the ClassiNet, where a one-to-one correspondence exists between feature predictors and vertices. The weight of the directed edge e ij connecting a vertex v i to a vertex v j represents the conditional probability that given v i exists in an instance, v j also exists in the same instance. We show that ClassiNets generalize word co-occurrence graphs by considering implicit co-occurrences between features. We extract numerous features from the trained ClassiNet to overcome feature sparseness. In particular, for a given instance x , we find similar features from ClassiNet that did not appear in x , and append those features in the representation of x . Moreover, we propose a method based on graph propagation to find features that are indirectly related to a given short-text. We evaluate ClassiNets on several benchmark datasets for short-text classification. Our experimental results show that by using ClassiNet, we can statistically significantly improve the accuracy in short-text classification tasks, without having to use any external resources such as thesauri for finding related features.

Short-text Classification Tasks Research Articles

Articles published on Short-text Classification Tasks

A head-to-head attention with prompt text augmentation for text classification

Concept and dependencies enhanced graph convolutional networks for short text classification

A Fine-grained Chinese Short Text Classification Method Based on Capsule Networks

Improving Chinese Word Representation with Conceptual Semantics

ClassiNet -- Predicting Missing Features for Short-Text Classification

Verbal aggression detection on Twitter comments: convolutional neural network for short-text sentiment analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Short-text Classification Tasks Research Articles

Articles published on Short-text Classification Tasks

A head-to-head attention with prompt text augmentation for text classification

Concept and dependencies enhanced graph convolutional networks for short text classification

A Fine-grained Chinese Short Text Classification Method Based on Capsule Networks

Improving Chinese Word Representation with Conceptual Semantics

ClassiNet -- Predicting Missing Features for Short-Text Classification

Verbal aggression detection on Twitter comments: convolutional neural network for short-text sentiment analysis