A Vote-Based Architecture to Generate Classified Datasets and Improve Performance of Intrusion Detection Systems Based on Supervised Learning

Diogo Teixeira,Pedro Pinto,Silvestre Malta

doi:10.3390/fi14030072

Abstract

An intrusion detection system (IDS) is an important tool to prevent potential threats to systems and data. Anomaly-based IDSs may deploy machine learning algorithms to classify events either as normal or anomalous and trigger the adequate response. When using supervised learning, these algorithms require classified, rich, and recent datasets. Thus, to foster the performance of these machine learning models, datasets can be generated from different sources in a collaborative approach, and trained with multiple algorithms. This paper proposes a vote-based architecture to generate classified datasets and improve the performance of supervised learning-based IDSs. On a regular basis, multiple IDSs in different locations send their logs to a central system that combines and classifies them using different machine learning models and a majority vote system. Then, it generates a new and classified dataset, which is trained to obtain the best updated model to be integrated into the IDS of the companies involved. The proposed architecture trains multiple times with several algorithms. To shorten the overall runtimes, the proposed architecture was deployed in Fed4FIRE+ with Ray to distribute the tasks by the available resources. A set of machine learning algorithms and the proposed architecture were assessed. When compared with a baseline scenario, the proposed architecture enabled to increase the accuracy by 11.5% and the precision by 11.2%.

Highlights

Published: 25 February 2022Cyberattacks are constantly performed against companies and institutions, and these criminal activities may have different objectives, such as to disrupt services, steal confidential information, or perform extortion [1]
An intrusion detection system (IDS) is an important tool for a system administrator to prevent potential threats to systems and data, as it aims to detect attacks against information systems and protect these systems against malware and unauthorized access to a network or a system [3]
The IDSs located in a set of companies send their updated records, i.e., service logs (1), to a central system, which applies them to multiple models based on different algorithms

Summary

Introduction

Cyberattacks are constantly performed against companies and institutions, and these criminal activities may have different objectives, such as to disrupt services, steal confidential information, or perform extortion [1]. Future Internet 2022, 14, 72 or systems’ behavior does not follow the normal behavior or defined pattern [4] These patterns and anomalies can be tested using machine learning algorithms. In this paper is proposed a centralized and vote-based architecture to generate classified datasets and improve the performance of supervised learning-based intrusion detection systems. The IDSs located in a set of companies send their updated records, i.e., service logs (1), to a central system (master), which applies them to multiple models based on different algorithms. The generation of a dataset using recent and diverse records enables to enrich the dataset, improving the accuracy and precision of IDS over time This architecture assumes the intensive and scalable training of models, and this takes time and resources.

Related Work

The Proposed Architecture

Results and Analysis

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future Internet	Publication Date: Feb 25, 2022
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Vote-Based Architecture to Generate Classified Datasets and Improve Performance of Intrusion Detection Systems Based on Supervised Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet

Lead the way for us

Similar Papers

A novel hybrid automatic intrusion detection system using machine learning technique for anomalous detection based on traffic prediction
D Vinod ... M Prasad
-
D Vinod, et. al.D Vinod ... M Prasad
05 Apr 2023
05 Apr 2023

Benchmarking of Machine Learning for Anomaly Based Intrusion Detection Systems in the CICIDS2017 Dataset
Ziadoon Kamil Maseer ... Cik Feresa Mohd Foozy
IEEE Access | VOL. 9
Ziadoon Kamil Maseer, et. al.Ziadoon Kamil Maseer ... Cik Feresa Mohd Foozy
01 Jan 2020
IEEE Access | VOL. 9

A Review on Feature Selection and Ensemble Techniques for Intrusion Detection System
Majid Torabi ... Razali Yaakob
International Journal of Advanced Computer Science and Applications | VOL. 12
Majid Torabi, et. al.Majid Torabi ... Razali Yaakob
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 12

Preventing the Mistraining of Anomaly-Based IDSs through Ensemble Systems
Conor Fellin ... Michael Haney
-
Conor Fellin, et. al.Conor Fellin ... Michael Haney
01 Jun 2014
01 Jun 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Vote-Based Architecture to Generate Classified Datasets and Improve Performance of Intrusion Detection Systems Based on Supervised Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet