Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data

Nathan Martindale,Douglas A Talbert,Muhammad Ismail

doi:10.3390/info11060315

Nathan Martindale, Douglas A Talbert + Show 1 more

Open Access

https://doi.org/10.3390/info11060315

Copy DOI

Journal: Information	Publication Date: Jun 11, 2020
Citations: 25	License type: CC BY 4.0

Affiliation: Tennessee Technological University

Abstract

As new cyberattacks are launched against systems and networks on a daily basis, the ability for network intrusion detection systems to operate efficiently in the big data era has become critically important, particularly as more low-power Internet-of-Things (IoT) devices enter the market. This has motivated research in applying machine learning algorithms that can operate on streams of data, trained online or “live” on only a small amount of data kept in memory at a time, as opposed to the more classical approaches that are trained solely offline on all of the data at once. In this context, one important concept from machine learning for improving detection performance is the idea of “ensembles”, where a collection of machine learning algorithms are combined to compensate for their individual limitations and produce an overall superior algorithm. Unfortunately, existing research lacks proper performance comparison between homogeneous and heterogeneous online ensembles. Hence, this paper investigates several homogeneous and heterogeneous ensembles, proposes three novel online heterogeneous ensembles for intrusion detection, and compares their performance accuracy, run-time complexity, and response to concept drifts. Out of the proposed novel online ensembles, the heterogeneous ensemble consisting of an adaptive random forest of Hoeffding Trees combined with a Hoeffding Adaptive Tree performed the best, by dealing with concept drift in the most effective way. While this scheme is less accurate than a larger size adaptive random forest, it offered a marginally better run-time, which is beneficial for online training.

Highlights

In an increasingly technological society, our reliance on networks of systems has dramatically exploded
We propose three heterogeneous ensembles based on Hoeffding Adaptive Tree (HAT) + Adaptive Random Forest (ARF), Support Vector Machine (SVM) + HAT, and SVM + ARF
We show that the heterogeneous ensemble of HAT + ARF handles concept drift better than the other detectors

Summary

Introduction

In an increasingly technological society, our reliance on networks of systems has dramatically exploded. This manifests itself in the vast adoption of Internet-of-Things (IoTs). Given how much data flows through these networks on a regular basis, manual analysis of traffic logs is inadequate for detecting malicious intrusions into the network. This creates an incentive to develop better automated systems for analyzing this traffic to judge whether it is benign or malicious. These automated systems are known as Network Intrusion

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

An Adaptive Heterogeneous Online Learning Ensemble Classifier for Nonstationary Environments.
Tinofirei Museba ... Fulufhelo Nelwamondo
Computational Intelligence and Neuroscience | VOL. 2021
Tinofirei Museba, et. al.Tinofirei Museba ... Fulufhelo Nelwamondo
01 Jan 2020
Computational Intelligence and Neuroscience | VOL. 2021

Building heterogeneous ensembles by pooling homogeneous ensembles
Maryam Sabzevari ... Gonzalo Martínez-Muñoz
International Journal of Machine Learning and Cybernetics | VOL. 13
Maryam Sabzevari, et. al.Maryam Sabzevari ... Gonzalo Martínez-Muñoz
13 Oct 2021
International Journal of Machine Learning and Cybernetics | VOL. 13

Cardiac surgery risk prediction using ensemble machine learning to incorporate legacy risk scores: A benchmarking study.
Tim Dong ... Gianni D Angelini
DIGITAL HEALTH | VOL. 9
Tim Dong, et. al.Tim Dong ... Gianni D Angelini
01 Jan 2023
DIGITAL HEALTH | VOL. 9

Generalizability evaluation of heterogeneous ensembles models for benthic macroinvertebrate index predictions
Taeseung Park ... Yoonkyung Cha
-
Taeseung Park, et. al.Taeseung Park ... Yoonkyung Cha
08 Mar 2024
08 Mar 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ensemble-Based Online Machine Learning Algorithms for Network Intrusion Detection Systems Using Streaming Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Information