An Empirical Study for Detecting Fake Facebook Profiles Using Supervised Mining Techniques

Mohammed Basil Albayati,Ahmad Mousa Altamimi

doi:10.31449/inf.v43i1.2319

Abstract

Our social life and the way of people communicate are greatly affected by the social media technologies. The variety of stand-alone and built-in social media services such as Facebook, Twitter, LinkedIn, and alike facilitate users to create highly interactive platforms. However, these overwhelming technologies made us sank in an enormous amount of information. Recently, Facebook exposed data on 50 million Facebook unaware users for analytical purposes. Fake profiles are also used by Scammers to infiltrate networks of friends to wreak all sorts of havoc as stealing valuable information, financial fraud, or entering other user's social graph. In this paper, we turn our focus to Facebook fake profiles, and proposed a smart system (FBChecker) that enables users to check if any Facebook profile is fake. To achieve that, FBChecker utilizes the data mining approach to analyze and classify a set of behavioral and informational attributes provided in the personal profiles. Specifically, we empirically examine these attributes using four supervised data mining algorithms (e.g., k-NN, decision tree, SVM, and naive Bayes) to determine how successfully we can recognize the fake profiles. To demonstrate the validity of our conceptual work, the selected classifiers have been implemented using RapidMiner data science platform with a dataset of 200 profiles collected from the authors’ profile and a honeypot page. Two experiments are developed; in the first one, the k-NN schema is applied as an estimator model for imputation the missing data with substituted values, whereas in the second experiment a filtering operator is applied to exclude the profiles with missing values. Results showed high accuracy rate with the all classifiers, however, the SVM outperforms other classifiers with an accuracy rate of 98.0% followed by Naive Bayes.

Highlights

In recent years, social media technologies (e.g., Facebook, Twitter, LinkedIn, etc.) have become a vital part of our life [1]
Statistics and surveys for example the one that conducted by the American Academy of Pediatrics exhibit that about 84% of adolescents in America registered on Facebook social online site [5], showed that the average users spend more than two hours on social network and even more on social online sites such as Facebook, Twitter and else more than any other sites or platform [6]
We focused on the problem of detecting fake profiles in Facebook and presenting a smart detection system (FBChecker) to handle this problem based on the prediction and classification techniques of data mining

Summary

Introduction

Social media technologies (e.g., Facebook, Twitter, LinkedIn, etc.) have become a vital part of our life [1] They are designed and maintained by social media organizations presenting a portal for facilitating communication, interaction, sharing information, and entertainment via virtual communities and networks. Users typically utilize these services by creating their own profiles and connecting them with others’ profiles through various technologies that offer social media functionality [2]. Under the unsupervised methods, no labeled examples are provided and there is no notion of the output during the process, instead the data with similar attributes or similar behavior are grouped together (clustered) [22, 23]

Methods

Results

Conclusion