Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms

Khaldoon Awadh,Ayhan Akbaş

doi:10.2339/politeknik.693221

Abstract

In recent years, the use of machine learning and data mining technologies has drawn researchers’ attention to new ways to improve the performance of Intrusion Detection Systems (IDS). These techniques have proven to be an effective method in distinguishing malicious network packets. One of the most challenging problems that researchers are faced with is the transformation of data into a form that can be handled effectively by Machine Learning Algorithms (MLA). In this paper, we present an IDS model based on the decision tree C4.5 algorithm with transforming simulated UNSW-NB15 dataset as a pre-processing operation. Our model uses Term Frequency.Inverse Document Frequency (TF.IDF) to convert data types to an acceptable and efficient form for machine learning to achieve high detection performance. The model has been tested with randomly selected 250000 records of the UNSW-NB15 dataset. Selected records have been grouped into various segment sizes, like 50, 500, 1000, and 5000 items. Each segment has been, further, grouped into two subsets of multi and binary class datasets. The performance of the Decision Tree C4.5 algorithm with Multilayer Perceptron (MLP) and Naive Bayes (NB) has been compared in Weka software. Our proposed method significantly has improved the accuracy of classifiers and decreased incorrectly detected instances. The increase in accuracy reflects the efficiency of transforming the dataset with TF.IDF of various segment sizes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms

Abstract

Talk to us

Similar Papers

More From: Politeknik Dergisi

Lead the way for us

Journal: Politeknik Dergisi	Publication Date: Dec 1, 2021
Citations: 6

Similar Papers

A Cloud Based Optimization Method for Zero-Day Threats Detection Using Genetic Algorithm and Ensemble Learning
Mike Nkongolo ... Jacobus Philippus Van Deventer
Electronics | VOL. 11
Mike Nkongolo, et. al.Mike Nkongolo ... Jacobus Philippus Van Deventer
31 May 2022
Electronics | VOL. 11

A novel hybrid autoencoder and modified particle swarm optimization feature selection for intrusion detection in the internet of things network
Yakub Kayode Saheed ... Aisha Abubakar Usman
Frontiers in Computer Science | VOL. 5
Yakub Kayode Saheed, et. al.Yakub Kayode Saheed ... Aisha Abubakar Usman
11 Apr 2023
Frontiers in Computer Science | VOL. 5

A Review on Intrusion Detection Using Machine Learning Techniques
Dhoma Harshavardhan Reddy ... Anupriya Elumalai
International Journal of Engineering Research in Computer Science and Engineering | VOL. 9
Dhoma Harshavardhan Reddy, et. al.Dhoma Harshavardhan Reddy ... Anupriya Elumalai
27 Dec 2022
International Journal of Engineering Research in Computer Science and Engineering | VOL. 9

Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.
Po-Hao Chen ... Maya Galperin-Aizenberg
Journal of Digital Imaging | VOL. 31
Po-Hao Chen, et. al.Po-Hao Chen ... Maya Galperin-Aizenberg
27 Oct 2017
Journal of Digital Imaging | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intrusion Detection Model Based on TF.IDF and C4.5 Algorithms

Abstract

Talk to us

Similar Papers

More From: Politeknik Dergisi