Adaptable Reduced-Complexity Approach Based on State Vector Machine for Identification of Criminal Activists on Social Media

Imran Shafi,Sadia Din,Zahid Hussain,Imran Ashraf,Gyu Sang Choi

doi:10.1109/access.2021.3094532

Imran Shafi, Sadia Din + Show 3 more

Open Access

https://doi.org/10.1109/access.2021.3094532

Copy DOI

Abstract

Security agencies face an emerging challenge of identifying and counter the malicious contents spread on the social media by the terrorists. However, text classification techniques are limited by visualization, pre-processing, features extraction, and larger features space. Additionally, change in criminal content require the learning models to identify altered malicious textual contents which poses extra challenge. This study proposes simplified yet adaptable framework that uses a novel features extraction algorithm for extracting features from the textual part of social media contents. The feature extraction considers selective features from only 8 dimensions and follows a six step process. The extracted features are suitably used to train the state vector machine for the classification of the malicious content. The performance of the proposed method is evaluated against other popular feature selection/extraction algorithms like term frequency-inverse document frequency, Gini Index (GI), Chi square statistics, and PCA. Additionally, machine learning classifiers like decision tree, random forest, and Naïve Bayes are also used for classification. Results suggest that the proposed approach consumes less energy on text visualization, pre-processing, and dimensionality reduction. It also reduces the time-space complexity of the features extraction process and is capable to steer according to the changing strategies of the active criminal groups. In addition, it can effectively analyze the propaganda material published by the extremists. It automatically identifies the radical text on social media platforms allowing understanding of the behaviors, characteristics and subsequent blockage of such content.

Highlights

T HE presence of criminals and terrorists in the online world is not new, but the current trend of social media enhanced their interest in the online community
The mentioned classifiers Decision Tree (DT), Random Forest (RF), SVM, and Naïve Bayes (NB) are trained and tested on the extracted features according to the said structure
1) Performance Improvement The results show that classifiers’ performance is more stable and improved with the features extraction algorithm (FEA) used for features extraction instead of using the combination of feature selection and feature extraction techniques

Summary

Introduction

T HE presence of criminals and terrorists in the online world is not new, but the current trend of social media enhanced their interest in the online community. Imran Shafi et al.: Adaptable Reduced-Complexity Approach Based on State Vector Machine for Identification of Criminal Activists on Social Media past from different parts of the world in which innocent people are provoked and recruited for criminal deeds by different terrorist groups through their malicious contents present on social media. Studies show that such criminal groups are found across the globe and have the capability to create unrest in society through their malicious campaign on social media.

Methods

Results

Conclusion