Incorporating Pre-Training in Long Short-Term Memory Networks for Tweets Classification

Shuhan Yuan,Yang Xiang,Xintao Wu

doi:10.1109/icdm.2016.0181

Abstract

The paper presents deep learning models for tweets binary classification. Our approach is based on the Long Short-Term Memory (LSTM) recurrent neural network and hence expects to be able to capture long-term dependencies among words. We develop two models for tweets classification. The basic model, called LSTM-TC, takes word embeddings as input, uses the LSTM layer to derive semantic tweet representation, and applies logistic regression to predict tweet label. The basic LSTM-TC model, like other deep learning models, requires a large amount of well-labeled training data to achieve good performance. To address this challenge, we further develop an improved model, called LSTM-TC*, that incorporates a large amount of weakly-labeled data for classifying tweets. We present two approaches of constructing the weakly-labeled data. One is based on hashtag information and the other is based on the prediction output of some traditional classifier that does not need a large amount of well-labeled training data. Our LSTM-TC* model first learns tweet representation based on the weakly-labeled data, and then trains the logistic regression classifier based on the small amount of well-labeled data. Experimental results show that: (1) the proposed method can be successfully used for tweets classification and outperform existing state-of-the-art methods, (2) pre-training tweet representation, which utilizes weakly-labeled tweets, can significantly improve the accuracy of tweets classification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Incorporating Pre-Training in Long Short-Term Memory Networks for Tweets Classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Incorporating pre-training in long short-term memory networks for tweet classification
Shuhan Yuan ... Yang Xiang
Social Network Analysis and Mining | VOL. 8
Shuhan Yuan, et. al.Shuhan Yuan ... Yang Xiang
14 Aug 2018
Social Network Analysis and Mining | VOL. 8

Kazakh and Russian Languages Identification Using Long Short-Term Memory Recurrent Neural Networks
Zhanibek Kozhirbayev ... Muslima Karabalayeva
-
Zhanibek Kozhirbayev, et. al.Zhanibek Kozhirbayev ... Muslima Karabalayeva
01 Sep 2017
01 Sep 2017

Bidirectional Quaternion Long Short-term Memory Recurrent Neural Networks for Speech Recognition
Titouan Parcollet ... Georges Linares
-
Titouan Parcollet, et. al.Titouan Parcollet ... Georges Linares
01 May 2019
01 May 2019

Performance prediction of fuel cells using long short‐term memory recurrent neural network
Lu Zheng ... Tao Zhang
International Journal of Energy Research | VOL. 45
Lu Zheng, et. al.Lu Zheng ... Tao Zhang
18 Jan 2021
International Journal of Energy Research | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incorporating Pre-Training in Long Short-Term Memory Networks for Tweets Classification

Abstract

Talk to us

Similar Papers