Performance Examination and Feature Selection on Sybil User Data using Recursive Feature Elimination

doi:10.35940/ijitee.i1108.0789s419

Abstract

Machine Learning (ML) research greatly helps in predicting model-based outcomes with high levels of accuracy based upon the training and testing of the models through the datasets. The social networks constitute one of the domains where ML can be used effectively to ensure the authenticity and security of the valid users. With the increase in usage of Online Social Networks (OSNs), the cases of spam and malicious activities can be found in abundance and Sybil nodes pose one such kind of safety and security hazard. Sybil account detection is not an easy task since they mimic the actual behavior of human accounts up to a great extent. In this paper, we look at one such scenario of Sybil accounts on the OSN, Twitter where machine leaning models have been used to train the machine with the existing datasets so as to be able to detect these malicious users before they can bring harm to the normal communication of the genuine users. Since the datasets used are so vast, the process of feature selection has been carried on the datasets as part of pre-processing before the actual classification as it assists in enhancing the model performance. Support Vector Machine–Recursive Feature Elimination (SVM-RFE) and Logistic Regression–Recursive Feature Elimination (LR-RFE) techniques have been used in this study for the selection of significant features. The classification model is trained on the selected features using Random Forest (RF) and K-Nearest Neighbor (KNN) algorithms. We also analyzed the biasing effects of fake accounts on the human accounts datasets during the process of features selection and classification. It has been shown that the RF algorithm outperformed KNN on the feature sets selected through SVM-RFE and LR-RFE.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Examination and Feature Selection on Sybil User Data using Recursive Feature Elimination

Abstract

Talk to us

Similar Papers

More From: International Journal of Innovative Technology and Exploring Engineering

Lead the way for us

Journal: International Journal of Innovative Technology and Exploring Engineering	Publication Date: Oct 1, 2019
Citations: 2

Similar Papers

Feature Selection Optimization Using Genetic Algorithm for Spambot Detection in an OSN
Arjun Singh ... Manu Sood
-
Arjun Singh, et. al.Arjun Singh ... Manu Sood
01 Jan 2020
01 Jan 2020

Kernel PCA and SVM-RFE based feature selection for classification of dengue microarray dataset
Elke Annisa Octaria ... Alhadi Bustamam
-
Elke Annisa Octaria, et. al.Elke Annisa Octaria ... Alhadi Bustamam
01 Jan 2020
01 Jan 2020

Development of Two-Stage SVM-RFE Gene Selection Strategy for Microarray Expression Data Analysis
Yuchun Tang ... Zhen Huang
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 4
Yuchun Tang, et. al.Yuchun Tang ... Zhen Huang
01 Jul 2007
IEEE/ACM Transactions on Computational Biology and Bioinformatics | VOL. 4

Feature selection and analysis on correlated gas sensor data with recursive feature elimination
Ke Yan ... David Zhang
Sensors and Actuators B: Chemical | VOL. 212
Ke Yan, et. al.Ke Yan ... David Zhang
16 Feb 2015
Sensors and Actuators B: Chemical | VOL. 212

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Examination and Feature Selection on Sybil User Data using Recursive Feature Elimination

Abstract

Talk to us

Similar Papers

More From: International Journal of Innovative Technology and Exploring Engineering