An Insider Data Leakage Detection Using One-Hot Encoding, Synthetic Minority Oversampling and Machine Learning Techniques.

Taher Al-Shehari,Rakan A Alsowail

doi:10.3390/e23101258

Abstract

Insider threats are malicious acts that can be carried out by an authorized employee within an organization. Insider threats represent a major cybersecurity challenge for private and public organizations, as an insider attack can cause extensive damage to organization assets much more than external attacks. Most existing approaches in the field of insider threat focused on detecting general insider attack scenarios. However, insider attacks can be carried out in different ways, and the most dangerous one is a data leakage attack that can be executed by a malicious insider before his/her leaving an organization. This paper proposes a machine learning-based model for detecting such serious insider threat incidents. The proposed model addresses the possible bias of detection results that can occur due to an inappropriate encoding process by employing the feature scaling and one-hot encoding techniques. Furthermore, the imbalance issue of the utilized dataset is also addressed utilizing the synthetic minority oversampling technique (SMOTE). Well known machine learning algorithms are employed to detect the most accurate classifier that can detect data leakage events executed by malicious insiders during the sensitive period before they leave an organization. We provide a proof of concept for our model by applying it on CMU-CERT Insider Threat Dataset and comparing its performance with the ground truth. The experimental results show that our model detects insider data leakage events with an AUC-ROC value of 0.99, outperforming the existing approaches that are validated on the same dataset. The proposed model provides effective methods to address possible bias and class imbalance issues for the aim of devising an effective insider data leakage detection system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Sep 27, 2021
Citations: 81	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Insider Data Leakage Detection Using One-Hot Encoding, Synthetic Minority Oversampling and Machine Learning Techniques.

Abstract

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Optimal weighted fusion based insider data leakage detection and classification model for Ubiquitous computing systems
Eatedal Alabdulkreem ... Romany F Mansour
Sustainable Energy Technologies and Assessments | VOL. 54
Eatedal Alabdulkreem, et. al.Eatedal Alabdulkreem ... Romany F Mansour
06 Oct 2022
Sustainable Energy Technologies and Assessments | VOL. 54

Recommender System for Geo-Social Access Control Framework
-
International Journal of Innovative Technology and Exploring Engineering | VOL. 9
--
31 Dec 2020
International Journal of Innovative Technology and Exploring Engineering | VOL. 9

A confirmatory analysis of the prevention insider threat in organization information system
Rahimah Abu Bakar ... Omar Dheyab
Journal of Technology and Humanities | VOL. 2
Rahimah Abu Bakar, et. al.Rahimah Abu Bakar ... Omar Dheyab
19 May 2021
Journal of Technology and Humanities | VOL. 2

Insider Threat Risk Prediction based on Bayesian Network
Nebrase Elmrabit ... Huiyu Zhou
Computers & Security | VOL. 96
Nebrase Elmrabit, et. al.Nebrase Elmrabit ... Huiyu Zhou
30 May 2020
Computers & Security | VOL. 96

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Insider Data Leakage Detection Using One-Hot Encoding, Synthetic Minority Oversampling and Machine Learning Techniques.

Abstract

Talk to us

Similar Papers

More From: Entropy