Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability

Guanjun Lin,Jun Zhang,Nan Sun,Yang Xiang,Houcine Hassan,Surya Nepal

doi:10.1109/access.2017.2710540

Abstract

With the trend that the Internet is becoming more accessible and our devices being more mobile, people are spending an increasing amount of time on social networks. However, due to the popularity of online social networks, cyber criminals are spamming on these platforms for potential victims. The spams lure users to external phishing sites or malware downloads, which has become a huge issue for online safety and undermined user experience. Nevertheless, the current solutions fail to detect Twitter spams precisely and effectively. In this paper, we compared the performance of a wide range of mainstream machine learning algorithms, aiming to identify the ones offering satisfactory detection performance and stability based on a large amount of ground truth data. With the goal of achieving real-time Twitter spam detection capability, we further evaluated the algorithms in terms of the scalability. The performance study evaluates the detection accuracy, the true/false positive rate and the F-measure; the stability examines how stable the algorithms perform using randomly selected training samples of different sizes. The scalability aims to better understand the impact of the parallel computing environment on the reduction of the training/testing time of machine learning algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2017
Citations: 35	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A framework for real-time spam detection in Twitter
Himank Gupta ... Maunendra Sankar Desarkar
-
Himank Gupta, et. al.Himank Gupta ... Maunendra Sankar Desarkar
01 Jan 2018
01 Jan 2018

Evaluating the Performance of Heterogeneous and Homogeneous Ensemble-based Models for Twitter Spam Classification
Akinyemi Moruff Oyelakin ... I S Olatinwo
Innovative Computing Review | VOL. 2
Akinyemi Moruff Oyelakin, et. al.Akinyemi Moruff Oyelakin ... I S Olatinwo
25 Dec 2022
Innovative Computing Review | VOL. 2

Empirical Evaluation and New Design for Fighting Evolving Twitter Spammers
Chao Yang ... Robert Harkreader
IEEE Transactions on Information Forensics and Security | VOL. 8
Chao Yang, et. al.Chao Yang ... Robert Harkreader
01 Aug 2013
IEEE Transactions on Information Forensics and Security | VOL. 8

A comparative study of the class imbalance problem in Twitter spam detection
Chaoliang Li ... Shigang Liu
Concurrency and Computation: Practice and Experience | VOL. 30
Chaoliang Li, et. al.Chaoliang Li ... Shigang Liu
17 Sep 2017
Concurrency and Computation: Practice and Experience | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistical Twitter Spam Detection Demystified: Performance, Stability and Scalability

Abstract

Talk to us

Similar Papers

More From: IEEE Access