
Within the past few years, social media platforms such as Facebook, Twitter, and Sina Weibo, have gradually become important channels for information dissemination and communication. However, in the meantime, these platforms are prone to be potentially attacked by spammers, who usually propagate disgusted information such as phishing URLs, false news, and even pornography to other users. Despite rapid increase of social media spammers, the traditional spammer detection methods become less effective. In this paper, we present a novel semi-supervised social media spammer detection approach, making full use of the message content and user behavior as well as the social relation information. First, we adapt the original constrained NMF-based semi-supervised learning (CNMF) algorithm, nonnegative matrix factorization (NMF) by imposing a label information constrain and sparseness constrain. Second, we present a novel CNMF-based integral framework for social media spammer detection by implementing the collaborative factorization on the message content matrix and the user behavior and social relation information matrix. Moreover, we explore the iterative update rule (IUR) and optimization algorithm for the spammer detection model. In addition, its corresponding convergence is also proven. Extensive experiments are conducted on the real-world dataset from Sina Weibo, the experiment results demonstrate that our proposed model performs significantly better than the conventionally applied supervised classifiers for the spammer detection.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.