A novel feature extraction methodology using Siamese convolutional neural networks for intrusion detection

Serafeim Moustakidis,Patrik Karlsson

doi:10.1186/s42400-020-00056-4

Abstract

Intrusion detection systems (IDS) can play a significant role in detecting security threats or malicious attacks that aim to steal information and/or corrupt network protocols. To deal with the dynamic and complex nature of cyber-attacks, advanced intelligent tools have been applied resulting into powerful and automated IDS that rely on the latest advances of machine learning (ML) and deep learning (DL). Most of the reported effort has been devoted on building complex ML/DL architectures adopting a brute force approach towards the maximization of their detection capacity. However, just a limited number of studies have focused on the identification or extraction of user-friendly risk indicators that could be easily used by security experts. Many papers have explored various dimensionality reduction algorithms, however a large number of selected features is still required to detect the attacks successfully, which humans cannot intuitively or immediately understand. To enhance user’s trust and understanding on data without sacrificing on accuracy, this paper contributes to the transformation of the available data collected by IDS into a single actionable and easy-to-understand risk indicator. To achieve this, a novel feature extraction pipeline was implemented consisting of the following components: (i) a fuzzy allocation scheme that transforms raw data to fuzzy class memberships, (ii) a novel modality transformation mechanism for converting feature vectors to images (Vec2im) and (iii) a dimensionality reduction module that makes use of Siamese convolutional neural networks that finally reduces the input data dimensionality into a 1-d feature space. The performance of the proposed methodology was validated with respect to detection accuracy, dimensionality reduction performance and execution time on the NSL-KDD dataset via a thorough comparative analysis that demonstrated its effectiveness (86.64% testing accuracy using only one feature) over a number of well-known feature selection (FS) and extraction techniques. The output of the proposed feature extraction pipeline could be potentially used by security experts as an indicator of malicious activity, whereas the generated images could be further utilized and/or integrated as a visual analytics tool in existing IDS.

Highlights

An Intrusion detection systems (IDS) is a security tool that collects information from various sources aiming at identifying malicious activities and/or users that attempt to either get access to computers, steal protected data or even manipulate and disable information systems (Sharma and Gupta 2015)
We evaluated decision trees (Belson 1959; Witten et al 2011), driven by Gini’s diversity index, KNN (Atkeson et al 1997), as well as non-linear support vector machines (SVM) algorithms (Cortes and Vapnik 1995; Scholkopf 1997) with Gaussian kernel, which can deal with the overfitting problems that appear in highdimensional spaces
A novel feature extraction pipeline is proposed in this paper that consists of the following components: (i) a fuzzy allocation scheme that transforms raw data to fuzzy class memberships, (ii) a mechanism for converting feature vectors to image (Vec2im) and (iii) a dimensionality reduction module that makes use of Siamese convolutional neural networks that reduces the input data dimensionality into a 1-d feature space

Summary

Introduction

An IDS is a security tool that collects information from various sources (e.g. routers, computers, network data) aiming at identifying malicious activities and/or users that attempt to either get access to computers, steal protected data or even manipulate and disable information systems (Sharma and Gupta 2015). The first category of IDS compares the collected patterns of network traffic (2020) 3:16 evaluate and verify. Unlike signature and specification IDSs, automated intrusion detection (AID) systems is a new category that employs machine learning, statisticalbased or knowledge-based methods to define a normal model of the behavior of a computer system. The effectiveness of AID systems depends a lot on the quantity as well as quality of the network traffic patterns that are used as data instances during their training. One simplistic method to decide whether a behavior is normal or abnormal is by comparing it with the standard deviation of the normal user behaviors in the training dataset. Any example exceeding the predetermined threshold (e.g. three times the standard deviation) could be classified in the intrusion category. Development of ML-based AID systems comprises of two phases: the training phase and the testing phase

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Cybersecurity	Publication Date: Aug 14, 2020
Citations: 18	License type: open-access

R Discovery Prime

R Discovery Prime

A novel feature extraction methodology using Siamese convolutional neural networks for intrusion detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Cybersecurity

Lead the way for us

Similar Papers

Intelligent Differential Evolution based Feature Selection with Deep Neural Network for Intrusion Detection in Wireless Sensor Networks
Ibrahim M El El-Hasnony
Journal of Intelligent Systems and Internet of Things | VOL. -
Ibrahim M El El-HasnonyIbrahim M El El-Hasnony
01 Jan 2019
Journal of Intelligent Systems and Internet of Things | VOL. -

Neural Networks for Intrusion Detection
Danielagotseva ... Roumen Trifonov
-
Danielagotseva, et. al. Danielagotseva ... Roumen Trifonov
01 Oct 2019
01 Oct 2019

A novel hybrid automatic intrusion detection system using machine learning technique for anomalous detection based on traffic prediction
D Vinod ... M Prasad
-
D Vinod, et. al.D Vinod ... M Prasad
05 Apr 2023
05 Apr 2023

A Novel Approach for intrusion Detection in Heterogeneous Wireless Sensor Networks using multipath Routing
Suman Jyothula
IOSR Journal of Computer Engineering | VOL. 16
Suman JyothulaSuman Jyothula
01 Jan 2014
IOSR Journal of Computer Engineering | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel feature extraction methodology using Siamese convolutional neural networks for intrusion detection

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Cybersecurity