Co-Detection of crowdturfing microblogs and spammers in online social networks

Bo Liu,Benyuan Liu,Xiangguo Sun,Zeyang Ni,Xinwen Fu,Junzhou Luo,Jiuxin Cao

doi:10.1007/s11280-019-00727-4

Abstract

The rise of online crowdsourcing services has prompted an evolution from traditional spamming accounts, which spread unwanted advertisements and fraudulent content, into novel spammers that resemble those of normal users. Prior research has mainly focused on machine accounts and spams separately, but characteristics of new types of spammers and spamming make it difficult for traditional methods to perform well. In this paper, we integrate the study of these new types of spammers with the study of crowdturfing microblogs, investigating the mechanism of crowdsourcing and the close relationship between crowdturfing spammers and microblogs in order to detect new types of spammers and spams more precisely. We propose a novel semi-supervised learning framework for co-detecting crowdturfing microblogs and spammers by comprehensively modeling user behavior, message content, and users’ following and retweeting networks. In order to meet the challenge of sparsely labeled datasets, we design an elaborate co-detection target optimal function to minimize empirical error and to permit the dissemination of sparse labels to unlabeled samples. The advantage of this framework is threefold. First, through a deep-level mining of new-type spammers, we aggregate a number of new-found features that can help us make significant distinctions between normal users and new-type spammers. Secondly, by modeling both following networks and retweeting networks, we characterize the essence of the crowdsourcing mechanism abused by spammers in crowdturfing microblog diffusion to markedly increase detection performance. Thirdly, through our optimal function based on semi-supervised methods, we overcome the problem of label sparseness, thus obtaining a more reliable capacity to deal with the challenges of big, sparsely labeled data. Extensive experiments on real datasets demonstrate that our method outperforms four baselines in various metrics (Precision-Recall, AUC values, Precision@K and so on). We also develop a robust system, the functions of which include data collection and availability analysis, spam and spammer detection, and visualization. To render our experiments replicable, we have made our dataset and codes openly available at https://github.com/sunxiangguo/Crowdturfing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Co-Detection of crowdturfing microblogs and spammers in online social networks

Abstract

Talk to us

Similar Papers

More From: World Wide Web

Lead the way for us

Journal: World Wide Web	Publication Date: Oct 23, 2019
Citations: 19

Similar Papers

A hybrid Data-Driven framework for Spam detection in Online Social Network
Chanchal Kumar ... Shiv Prakash
Procedia Computer Science | VOL. 218
Chanchal Kumar, et. al.Chanchal Kumar ... Shiv Prakash
01 Jan 2023
Procedia Computer Science | VOL. 218

Efficient spam detection across Online Social Networks
Hailu Xu ... Ahmad Javaid
-
Hailu Xu, et. al.Hailu Xu ... Ahmad Javaid
01 Mar 2016
01 Mar 2016

Under the Shadow of Sunshine
Srishti Gupta ... Payas Gupta
-
Srishti Gupta, et. al.Srishti Gupta ... Payas Gupta
15 May 2018
15 May 2018

Testing Assessment of Group Collaborations in OSNs
Izzat Alsmadi ... Mohammad Al-Abdullah
-
Izzat Alsmadi, et. al.Izzat Alsmadi ... Mohammad Al-Abdullah
18 Sep 2018
18 Sep 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Co-Detection of crowdturfing microblogs and spammers in online social networks

Abstract

Talk to us

Similar Papers

More From: World Wide Web