An Analysis of the KDD99 and UNSW-NB15 Datasets for the Intrusion Detection System

Muataz Salam Al-Daweri,Khairul Akram Zainol Ariffin,Salwani Abdullah,Mohamad Firham Efendy Md Senan

doi:10.3390/sym12101666

Muataz Salam Al-Daweri, Khairul Akram Zainol Ariffin + Show 2 more

Open Access

https://doi.org/10.3390/sym12101666

Copy DOI

Abstract

The significant increase in technology development over the internet makes network security a crucial issue. An intrusion detection system (IDS) shall be introduced to protect the networks from various attacks. Even with the increased amount of works in the IDS research, there is a lack of studies that analyze the available IDS datasets. Therefore, this study presents a comprehensive analysis of the relevance of the features in the KDD99 and UNSW-NB15 datasets. Three methods were employed: a rough-set theory (RST), a back-propagation neural network (BPNN), and a discrete variant of the cuttlefish algorithm (D-CFA). First, the dependency ratio between the features and the classes was calculated, using the RST. Second, each feature in the datasets became an input for the BPNN, to measure their ability for a classification task concerning each class. Third, a feature-selection process was carried out over multiple runs, to indicate the frequency of the selection of each feature. From the result, it indicated that some features in the KDD99 dataset could be used to achieve a classification accuracy above 84%. Moreover, a few features in both datasets were found to give a high contribution to increasing the classification’s performance. These features were present in a combination of features that resulted in high accuracy; the features were also frequently selected during the feature selection process. The findings of this study are anticipated to help the cybersecurity academics in creating a lightweight and accurate IDS model with a smaller number of features for the developing technologies.

Highlights

Due to the increasing demand for computer networks and network technologies, the attack incidents are growing day by day, making the intrusion detection system (IDS) an essential tool to use for keeping the networks secure
It has been proven to be effective against many different attacks, such as the denial of service (DoS), structured query language (SQL) injection, and brute-force [1,2,3]
The discrete variant of the cuttlefish algorithm (D-cuttlefish algorithm (CFA)) was used for feature selection to select the most relevant features over multiple iterations and runs to indicate the most selection to select the most relevant features over multiple iterations and runs to indicate the most frequently selected features

Summary

Introduction

Due to the increasing demand for computer networks and network technologies, the attack incidents are growing day by day, making the intrusion detection system (IDS) an essential tool to use for keeping the networks secure. Two approaches are to be considered when developing an IDS [4]: misuse-based and anomaly-based. In the misuse-based approach, the IDS attempts to match the patterns of already known network attacks. Its database gets updated continuously by storing the patterns of known network attacks. The anomaly-based IDS, on the other hand, attempts to detect unknown network attacks by comparing them to the regular connection patterns. The anomaly-based IDSs are considered to be adaptive, and they are susceptible to generate a high number of false positives [4,5]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Oct 13, 2020
Citations: 59	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Analysis of the KDD99 and UNSW-NB15 Datasets for the Intrusion Detection System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

A comprehensive guide to CAN IDS data and introduction of the ROAD dataset.
Miki E Verma ... Robert A Bridges
PloS one | VOL. 19
Miki E Verma, et. al.Miki E Verma ... Robert A Bridges
22 Jan 2024
PloS one | VOL. 19

An adaptive method and a new dataset, UKM-IDS20, for the network intrusion detection system
Muataz Salam Al-Daweri ... Khairul Akram Zainol Ariffin
Computer Communications | VOL. 180
Muataz Salam Al-Daweri, et. al.Muataz Salam Al-Daweri ... Khairul Akram Zainol Ariffin
10 Sep 2021
Computer Communications | VOL. 180

A novel GPU based intrusion detection system using deep autoencoder with Fruitfly optimization
R Sekhar ... K Sasirekha
SN Applied Sciences | VOL. 3
R Sekhar, et. al.R Sekhar ... K Sasirekha
01 May 2021
SN Applied Sciences | VOL. 3

Toward lightweight intrusion detection systems using the optimal and efficient feature pairs of the Bot-IoT 2018 dataset
Erman Özer ... Jahongir Azimjonov
International Journal of Distributed Sensor Networks | VOL. 17
Erman Özer, et. al.Erman Özer ... Jahongir Azimjonov
01 Oct 2021
International Journal of Distributed Sensor Networks | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Analysis of the KDD99 and UNSW-NB15 Datasets for the Intrusion Detection System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry