Synthetic and privacy-preserving traffic trace generation using generative AI models for training Network Intrusion Detection Systems

Giuseppe Aceto,Fabio Giampaolo,Ciro Guida,Stefano Izzo,Antonio Pescapè,Francesco Piccialli,Edoardo Prezioso

doi:10.1016/j.jnca.2024.103926

Abstract

Network Intrusion Detection Systems (NIDS) are crucial tools for protecting networked devices from cyberattacks. Recent development in the field of Artificial Intelligence (AI) has provided tremendous advantages in implementing NIDSs able to monitor network traffic and block cyberattacks in real-time. In the literature, it is widely recognized that the effective training of a NIDS requires a large quantity of labeled traffic, representative of attacks. Nonetheless, the availability of public and abundant datasets remains remarkably restricted due to the cost of gathering and labeling real traffic traces and privacy concerns for sharing them. To tackle these challenges, in this paper we present a generative AI model capable of synthesizing anonymized traffic traces from real ones, thus dealing with privacy, abundance, and representativeness. The proposal is based on a Conditional Variational Autoencoder (CVAE) and a preprocessing procedure specifically designed for the generation of new traffic traces. To validate our solution, we conduct an extensive empirical study leveraging three recent and publicly-available datasets, containing benign and malicious traffic. The validation is carried out from both the perspectives of classification performance of a robust NIDS and the quality of synthetic data, in comparison to the utilization of real data. We compare our CVAE with two state-of-the-art AI-based traffic data generators and prove that, trained with traces emitted by our generative model, a NIDS has a limited F1-score loss compared to training on real data; competing models instead struggle or fail to generate traces that are as effective for NIDS training and as statistically similar to the original. We make the synthetic datasets available in both PCAP and tabular formats, to facilitate the reproducibility of our findings and encourage further exploration in the field of generative AI for networking.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Synthetic and privacy-preserving traffic trace generation using generative AI models for training Network Intrusion Detection Systems

Abstract

Talk to us

Similar Papers

More From: Journal of Network and Computer Applications

Lead the way for us

Similar Papers

Towards AGI: Cognitive Architecture Based on Hybrid and Bionic Principles
R V Dushkin
-
R V DushkinR V Dushkin
13 Jul 2021
13 Jul 2021

On Chatbots and Generative Artificial Intelligence.
Eric Karl Oermann ... Douglas Kondziolka
Neurosurgery | VOL. 92
Eric Karl Oermann, et. al.Eric Karl Oermann ... Douglas Kondziolka
13 Feb 2023
Neurosurgery | VOL. 92

Generative Artificial Intelligence: A Historical and Future Perspective
Hatice Kübra Kılınç ... Ö Fatih Keçecioğlu
Academic Platform Journal of Engineering and Smart Systems | VOL. 12
Hatice Kübra Kılınç, et. al.Hatice Kübra Kılınç ... Ö Fatih Keçecioğlu
31 May 2024
Academic Platform Journal of Engineering and Smart Systems | VOL. 12

Consideration of breakthrough technologies in the field of genomic research and artificial intelligence in healthcare
L.V Chkhutiashvili
Buhuchet v zdravoohranenii (Accounting in Healthcare) | VOL. -
L.V ChkhutiashviliL.V Chkhutiashvili
01 Nov 2021
Buhuchet v zdravoohranenii (Accounting in Healthcare) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Synthetic and privacy-preserving traffic trace generation using generative AI models for training Network Intrusion Detection Systems

Abstract

Talk to us

Similar Papers

More From: Journal of Network and Computer Applications