Social platform is one of the most commonly used sites in the today’s world, and people from different places exchange information, express opinions, etc. Twitter is one of the most convenient platforms where Twitter data are frequently used for research and are one of the largest micro-blogging platforms. As there is an increase in number of spam accounts, there is a need to distinguish between genuine and fake user accounts. In Twitter, there is rise in the number of spam accounts, to spread malicious information among the users. So, to overcome the problem of spam in Twitter, an analysis of the opinions and the emotions of users in the form of tweets have been carried out. In this paper, we present a model which performs sentimental analysis on real-time data that are collected on a particular topic of interest for the users who recently posted on that topic. Using this model, we compute the sentiment score of each user with the content-based features to detect spam in Twitter. The proposed method makes use of a custom rule-based algorithm for detection of bots and compares it with few other algorithms like MLP, decision tree and random forest to see how efficient the model is in detection of spam accounts.
Read full abstract