Incorporating pre-training in long short-term memory networks for tweet classification

Shuhan Yuan,Xintao Wu,Yang Xiang

doi:10.1007/s13278-018-0530-1

Abstract

The paper presents deep learning models for tweet classification. Our approach is based on the long short-term memory (LSTM) recurrent neural network and hence expects to be able to capture long-term dependencies among words. We first focus on binary classification task. The basic model, called LSTM-TC, takes word embeddings as inputs, uses LSTM to derive the semantic tweet representation, and applies logistic regression to predict the tweet label. The basic LSTM-TC model, like other deep learning models, requires a large amount of well-labeled training data to achieve good performance. To address this challenge, we further develop an improved model, called LSTM-TC*, that incorporates a large amount of weakly labeled data for classifying tweets. Finally, we extend the models, called LSTM-Multi-TC and LSTM-Multi-TC*, to multiclass classification task. We present two approaches of constructing the weakly labeled data. One is based on hashtag information and the other is based on the prediction output of a traditional classifier that does not need a large amount of well-labeled training data. Our LSTM-TC* and LSTM-Multi-TC* models first learn tweet representation based on the weakly labeled data, and then train the classifiers based on the small amount of well-labeled data. Experimental results show that: (1) the proposed methods can be successfully used for tweet classification and outperform existing state-of-the-art methods; (2) pre-training tweet representations, which utilizes weakly labeled tweets, can significantly improve the accuracy of tweet classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Incorporating pre-training in long short-term memory networks for tweet classification

Abstract

Talk to us

Similar Papers

More From: Social Network Analysis and Mining

Lead the way for us

Journal: Social Network Analysis and Mining	Publication Date: Aug 14, 2018
Citations: 14

Similar Papers

Incorporating Pre-Training in Long Short-Term Memory Networks for Tweets Classification
Shuhan Yuan ... Xintao Wu
-
Shuhan Yuan, et. al.Shuhan Yuan ... Xintao Wu
01 Dec 2016
01 Dec 2016

Long short-term memory recurrent neural networks for antibacterial peptide identification
Michael Youmans ... Christian Spainhour
-
Michael Youmans, et. al.Michael Youmans ... Christian Spainhour
01 Nov 2017
01 Nov 2017

Kazakh and Russian Languages Identification Using Long Short-Term Memory Recurrent Neural Networks
Zhanibek Kozhirbayev ... Muslima Karabalayeva
-
Zhanibek Kozhirbayev, et. al.Zhanibek Kozhirbayev ... Muslima Karabalayeva
01 Sep 2017
01 Sep 2017

Bidirectional Quaternion Long Short-term Memory Recurrent Neural Networks for Speech Recognition
Titouan Parcollet ... Georges Linares
-
Titouan Parcollet, et. al.Titouan Parcollet ... Georges Linares
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incorporating pre-training in long short-term memory networks for tweet classification

Abstract

Talk to us

Similar Papers

More From: Social Network Analysis and Mining