Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem

Mantas Bacevicius,Agne Paulauskaite-Taraseviciene

doi:10.3390/app13127328

Mantas Bacevicius, Agne Paulauskaite-Taraseviciene

Open Access

https://doi.org/10.3390/app13127328

Copy DOI

Journal: Applied Sciences	Publication Date: Jun 20, 2023
Citations: 6	License type: CC BY 4.0

Affiliation: University of Technology

Abstract

Various machine learning algorithms have been applied to network intrusion classification problems, including both binary and multi-class classifications. Despite the existence of numerous studies involving unbalanced network intrusion datasets, such as CIC-IDS2017, a prevalent approach is to address the issue by either merging the classes to optimize their numbers or retaining only the most dominant ones. However, there is no consistent trend showing that accuracy always decreases as the number of classes increases. Furthermore, it is essential for cybersecurity practitioners to recognize the specific type of attack and comprehend the causal factors that contribute to the resulting outcomes. This study focuses on tackling the challenges associated with evaluating the performance of multi-class classification for network intrusions using highly imbalanced raw data that encompasses the CIC-IDS2017 and CSE-CIC-IDS2018 datasets. The research concentrates on investigating diverse machine learning (ML) models, including Logistic Regression, Random Forest, Decision Trees, CNNs, and Artificial Neural Networks. Additionally, it explores the utilization of explainable AI (XAI) methods to interpret the obtained results. The results obtained indicated that decision trees using the CART algorithm performed best on the 28-class classification task, with an average macro F1-score of 0.96878.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Electrocardiogram analysis using a combination of statistical, geometric, and nonlinear heart rate variability features
Alan Jovic ... Nikola Bogunovic
Artificial Intelligence in Medicine | VOL. 51
Alan Jovic, et. al.Alan Jovic ... Nikola Bogunovic
25 Oct 2010
Artificial Intelligence in Medicine | VOL. 51

The severity prediction of the binary and multi-class cardiovascular disease − A machine learning-based fusion approach
Hafsa Binte Kibria ... Abdul Matin
Computational Biology and Chemistry | VOL. 98
Hafsa Binte Kibria, et. al.Hafsa Binte Kibria ... Abdul Matin
31 Mar 2022
Computational Biology and Chemistry | VOL. 98

A combinatorial optimization approach for multi-label associative classification
Yuchun Zou ... Chun-An Chou
Knowledge-Based Systems | VOL. 240
Yuchun Zou, et. al.Yuchun Zou ... Chun-An Chou
31 Dec 2021
Knowledge-Based Systems | VOL. 240

An Experimental Analysis of Attack Classification Using Machine Learning in IoT Networks.
Andrew Churcher ... Rehmat Ullah
Sensors | VOL. 21
Andrew Churcher, et. al.Andrew Churcher ... Rehmat Ullah
10 Jan 2021
Sensors | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine Learning Algorithms for Raw and Unbalanced Intrusion Detection Data in a Multi-Class Classification Problem

Abstract

Talk to us

Similar Papers

More From: Applied Sciences