How to Effectively Collect and Process Network Data for Intrusion Detection?

Mikołaj Komisarek,Michał Choraś,Witold Hołubowicz,Marek Pawlicki,Rafał Kozik

doi:10.3390/e23111532

Abstract

The number of security breaches in the cyberspace is on the rise. This threat is met with intensive work in the intrusion detection research community. To keep the defensive mechanisms up to date and relevant, realistic network traffic datasets are needed. The use of flow-based data for machine-learning-based network intrusion detection is a promising direction for intrusion detection systems. However, many contemporary benchmark datasets do not contain features that are usable in the wild. The main contribution of this work is to cover the research gap related to identifying and investigating valuable features in the NetFlow schema that allow for effective, machine-learning-based network intrusion detection in the real world. To achieve this goal, several feature selection techniques have been applied on five flow-based network intrusion detection datasets, establishing an informative flow-based feature set. The authors’ experience with the deployment of this kind of system shows that to close the research-to-market gap, and to perform actual real-world application of machine-learning-based intrusion detection, a set of labeled data from the end-user has to be collected. This research aims at establishing the appropriate, minimal amount of data that is sufficient to effectively train machine learning algorithms in intrusion detection. The results show that a set of 10 features and a small amount of data is enough for the final model to perform very well.

Highlights

With the list of known network threats expanding every year, researchers and cybersecurity experts are constantly working on new safeguards and new tools of protection
The main contribution of this work is to cover the research gap related to identifying and investigating valuable features in the NetFlow schema that allow for effective, machine-learningbased network intrusion detection in the real world
Based on the popular flow-based data schemas, the research process presented in this paper addresses a research gap related to the verification of a list of features that contribute to network intrusion detection

Summary

Introduction

With the list of known network threats expanding every year, researchers and cybersecurity experts are constantly working on new safeguards and new tools of protection. Cybercriminals keep trying to pull newer and more sophisticated tricks to steal sensitive or personal data or cause damage to private businesses or government organizations [1,2]. To facilitate the use of machine learning to streamline network intrusion detection, good quality labeled data need to be collected. This enables the use of highly-accurate supervized learning techniques. The data-dependent algorithms are only as good as the data used to train them

Objectives

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Entropy	Publication Date: Nov 18, 2021
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

How to Effectively Collect and Process Network Data for Intrusion Detection?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Technologies, Methodologies and Challenges in Network Intrusion Detection and Prevention Systems
Nicoleta Stanciu
Informatica Economica | VOL. 17
Nicoleta StanciuNicoleta Stanciu
30 Mar 2013
Informatica Economica | VOL. 17

Intrusion detection system for wireless mesh network using multiple support vector machine classifiers with genetic-algorithm-based feature selection
R Vijayanand ... B Kannapiran
Computers & Security | VOL. 77
R Vijayanand, et. al.R Vijayanand ... B Kannapiran
18 Apr 2018
Computers & Security | VOL. 77

Research on the application of improved V-detector algorithm in network intrusion detection
Yuming Zhong ... Leyou Chen
Applied Mathematics and Nonlinear Sciences | VOL. 9
Yuming Zhong, et. al.Yuming Zhong ... Leyou Chen
07 Oct 2023
Applied Mathematics and Nonlinear Sciences | VOL. 9

Efficient Hybrid Network (Wired and Wireless) Intrusion Detection using Statistical Data Streams and Detection of Clustered Alerts
Thangavel
Journal of Computer Science | VOL. 7
Thangavel Thangavel
01 Sep 2011
Journal of Computer Science | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How to Effectively Collect and Process Network Data for Intrusion Detection?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy