A topological data analysis approach for detecting data poisoning attacks against machine learning based network intrusion detection systems

Galamo F Monkam,Michael J De Lucia,Nathaniel D Bastian

doi:10.1016/j.cose.2024.103929

Abstract

Data poisoning attacks pose a significant security risk to network security software that utilizes machine learning (ML) for network intrusion detection. As network traffic continues to surge, ML becomes indispensable in detecting and characterizing malicious actors attempting to infiltrate computer networks. However, conventional ML assumes a benign environment, leaving room for adversaries to violate this assumption during the training phase. Detecting data poisoning attacks proves to be a challenging task, as attackers employ subtle alterations in the training data to create backdoors, trojans or triggers. Traditional techniques for addressing data poisoning attacks often focus only on enhancing ML model robustness rather than detecting poisoned data, necessitating the development of novel, more effective approaches. Hence, there is an urgent need to develop new methods for identifying poisoned data, ensuring the security of ML. We introduce a novel approach that harnesses the power of topological data analysis and unsupervised learning, enabling the early identification of poisoned data before training an ML model for network intrusion detection. Leveraging our approach, the extraction of topological features and subsequent application of clustering techniques leads to the creation of new clusters exclusively composed of poisoned data for removal prior to ML model training.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A topological data analysis approach for detecting data poisoning attacks against machine learning based network intrusion detection systems

Abstract

Talk to us

Similar Papers

More From: Computers & Security

Lead the way for us

Similar Papers

Do You Consent to the Use of Your Biological Data for Training ML and AI Models? Online Survey Targeting Clinicians and Researchers.
Yury Rusinovich ... Volha Rusinovich
Web3 Journal: ML in Health Science | VOL. 1
Yury Rusinovich, et. al.Yury Rusinovich ... Volha Rusinovich
27 Jan 2024
Web3 Journal: ML in Health Science | VOL. 1

A Radial Visualisation for Model Comparison and Feature Identification
Jianlong Zhou ... Fang Chen
-
Jianlong Zhou, et. al.Jianlong Zhou ... Fang Chen
08 May 2020
08 May 2020

Latency Optimization for Blockchain-Empowered Federated Learning in Multi-Server Edge Computing
Dinh C Nguyen ... Seyyedali Hosseinalipour
IEEE Journal on Selected Areas in Communications | VOL. 40
Dinh C Nguyen, et. al.Dinh C Nguyen ... Seyyedali Hosseinalipour
01 Dec 2022
IEEE Journal on Selected Areas in Communications | VOL. 40

Disclosure control of machine learning models from trusted research environments (TRE): New challenges and opportunities
Esma Mansouri-Benssassi ... Emily Jefferson
Heliyon | VOL. 9
Esma Mansouri-Benssassi, et. al.Esma Mansouri-Benssassi ... Emily Jefferson
01 Apr 2023
Heliyon | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A topological data analysis approach for detecting data poisoning attacks against machine learning based network intrusion detection systems

Abstract

Talk to us

Similar Papers

More From: Computers & Security