Abstract

With the rapid advancements of ubiquitous information and communication technologies, a large number of trustworthy online systems and services have been deployed. However, cybersecurity threats are still mounting. An intrusion detection (ID) system can play a significant role in detecting such security threats. Thus, developing an intelligent and accurate ID system is a non-trivial research problem. Existing ID systems that are typically used in traditional network intrusion detection system often fail and cannot detect many known and new security threats, largely because those approaches are based on classical machine learning methods that provide less focus on accurate feature selection and classification. Consequently, many known signatures from the attack traffic remain unidentifiable and become latent. Furthermore, since a massive network infrastructure can produce large-scale data, these approaches often fail to handle them flexibly, hence are not scalable. To address these issues and improve the accuracy and scalability, we propose a scalable and hybrid IDS, which is based on Spark ML and the convolutional-LSTM (Conv-LSTM) network. This IDS is a two-stage ID system: the first stage employs the anomaly detection module, which is based on Spark ML. The second stage acts as a misuse detection module, which is based on the Conv-LSTM network, such that both global and local latent threat signatures can be addressed. Evaluations of several baseline models in the ISCX-UNB dataset show that our hybrid IDS can identify network misuses accurately in 97.29% of cases and outperforms state-of-the-art approaches during 10-fold cross-validation tests.

Highlights

  • Information and communication technologies impact every aspect of society and people’s lives, so attacks on ICT systems are increasing

  • This imposes a great challenge to existing intrusion detection system (IDS) systems because these because these approaches are unscalable and often inefficient. To address these issues approaches are unscalable and often inefficient. To address these issues and improve the and improve the accuracy and scalability, we propose a scalable and hybrid IDS, which is based on accuracy and scalability, we propose a scalable and hybrid IDS, which is based on Spark machine learning (ML) and the

  • It is the last module of the proposed hybrid intrusion detection (ID) architecture that reports the ID activity

Read more

Summary

Introduction

Information and communication technologies impact every aspect of society and people’s lives, so attacks on ICT systems are increasing. There are three main categories of IDS according to dynamic detection methods; the first is the misuse detection technique, which is known as a signature-based system (SBS). The misuse attack detection technique achieves maximum accuracy and minimum false alarm rate, but it cannot detect unknown attacks, while the behavior-based system is known as ABS and detects an attack by comparing abnormal behavior to normal behavior. Anomaly and misuse intrusion detection techniques have their limitations, but in our hybrid approach we combine the two techniques to overcome their disadvantages and propose a novel classical technique joining the benefits of the two techniques to achieve improved performance over traditional methods. We propose an improved version of IDS, which is based on Spark ML and the Conv-LSTM network.

Related Work
Materials and Methods
Architecture
Datasets
Feature Engineering and Data Preparation
Implementation Details
Stage 1
Stage 2
The Alarm Module
Results
Performance processor and 32Metrics
Performance Metrics
Evaluation of the IDS System
Overall Analysis
Conclusions and Outlook
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call