A Hybrid Intrusion Detection System Based on Scalable K-Means+ Random Forest and Deep Learning

Chao Liu,Jialiang Wang,Zhaojun Gu

doi:10.1109/access.2021.3082147

Abstract

Digital assets have come under various network security threats in the digital age. As a kind of security equipment to protect digital assets, intrusion detection system (IDS) is less efficient if the alert is not timely and IDS is useless if the accuracy cannot meet the requirements. Therefore, an intrusion detection model that combines machine learning with deep learning is proposed in this paper. The model uses the k-means and the random forest (RF) algorithms for the binary classification, and distributed computing of these algorithms is implemented on the Spark platform to quickly classify normal events and attack events. Then, by using the convolutional neural network (CNN), long short-term memory (LSTM), and other deep learning algorithms, the events judged as abnormal are further classified into different attack types finally. At this stage, adaptive synthetic sampling (ADASYN) is adopted to solve the unbalanced dataset. The NSL-KDD and CIS-IDS2017 datasets are used to evaluate the performance of the proposed model. The experimental results show that the proposed model has better TPR for most of attack events, faster data preprocessing speed, and potentially less training time. In particular, the accuracy of multi-target classification can reach as high as 85.24% in the NSL-KDD dataset and 99.91% in the CIC-IDS2017 dataset.

Highlights

The world is moving towards digitization, networking, and intelligence
PROPOSED INTRUSION DETECTION MODEL Based on the ideas above, this paper considers using deep learning algorithms combined with machine learning algorithms to classify intrusion events
TPR is the ratio of samples labeled as attacks that are correctly predicted to be attacked in test sets to all incidents labeled as attacks

Summary

INTRODUCTION

The world is moving towards digitization, networking, and intelligence. The vigorous development of the internet has accelerated the flow of data assets. Liu et al.: Hybrid IDS Based on Scalable K-Means+ RF and Deep Learning neurons, can tap hidden intrusion features Algorithms such as DNN [7], Convolutional Neural Network (CNN) [8], and Long Short-Term Memory (LSTM) [9], take too much time to train the model. Souza et al [19] proposed a DNN-KNN hybrid binary classification method In addition to these studies, there are combinations of distributed machine learning algorithms and deep learning algorithms. Ensuring the speed of intrusion detection without affecting the accuracy is an important problem that needs to be considered for future IDS models Based on these studies, this paper considers improving the model so that different attack events can be classified more quickly making the model more useful.

INTRUSION DETECTION FRAMEWORK AND KEY TECHNOLOGIES

DATASET DESCRIPTION AND PREPROCESSING

BINARY CLASSIFICATION STAGE BASED ON DISTRIBUTED MACHINE LEARNING

INTRUSION DETECTION EXPERIMENTAL DESIGN

EVALUATING INDICATOR

RESULTS OF THE BINARY CLASSIFICATION STAGE

Findings

CONCLUSION AND FUTURE WORK

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 83	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Hybrid Intrusion Detection System Based on Scalable K-Means+ Random Forest and Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

An improved binary manta ray foraging optimization algorithm based feature selection and random forest classifier for network intrusion detection
Ibrahim Hayatu Hassan ... Sahabi Ali Yusuf
Intelligent Systems with Applications | VOL. 16
Ibrahim Hayatu Hassan, et. al.Ibrahim Hayatu Hassan ... Sahabi Ali Yusuf
01 Nov 2022
Intelligent Systems with Applications | VOL. 16

Intrusion Detection System for Industrial Internet of Things Based on Deep Reinforcement Learning
Sumegh Tharewal ... Mohammad Shabaz
Wireless Communications and Mobile Computing | VOL. 2022
Sumegh Tharewal, et. al.Sumegh Tharewal ... Mohammad Shabaz
07 Mar 2022
Wireless Communications and Mobile Computing | VOL. 2022

Packet Preprocessing in CNN-Based Network Intrusion Detection System
Wooyeon Jo ... Changhoon Lee
Electronics | VOL. 9
Wooyeon Jo, et. al.Wooyeon Jo ... Changhoon Lee
16 Jul 2020
Electronics | VOL. 9

Network Intrusion Detection Model Based on CNN and GRU
Bo Cao ... Chen Chen
Applied Sciences | VOL. 12
Bo Cao, et. al.Bo Cao ... Chen Chen
21 Apr 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Intrusion Detection System Based on Scalable K-Means+ Random Forest and Deep Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access