Abstract

Emoji is a picture character used in social media to express emotion of a text message. With the increasing use of emoji few who study the relationship between emoji and text. Due to diversity of emoji and the similarity meaning between emoji, emoji classification task is more relative complex than common text classification task. In this paper, we build a computational model by extracted various features namely: linguistic feature, semantic feature, and lexicon feature to improve emoji classification performance. Then we train 400k tweet using two different classifiers Stochastic Gradient Descent Classifier and Logistic Regression. The experiment showed that our proposed feature using Logistic Regression outperformed the baseline.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call