Anomaly-based error and intrusion detection in tabular data: No DNN outperforms tree-based classifiers

Tommaso Zoppi,Stefano Gazzini,Andrea Ceccarelli

doi:10.1016/j.future.2024.06.051

Abstract

Recent years have seen a growing involvement of researchers and practitioners in crafting Deep Neural Networks (DNNs) that seem to outperform existing machine learning approaches for solving classification problems as anomaly-based error and intrusion detection. Undoubtedly, classifiers may be very diverse among themselves, and choosing one or another is typically due to the specific task and target system. Designing and training the optimal tabular data classifier requires extensive experimentation, sensitivity analyses, big datasets, and domain-specific knowledge that may not be available at will or considered a non-strategical asset by many companies and stakeholders. This paper compares, using a total of 23 public datasets: i) traditional (tree-based, statistical) supervised classifiers, ii) DNNs that are specifically designed for classifying tabular data, iii) DNNs for image classification that are applied to tabular data after converting data points into images, alone and as ensembles. Experimental results and related discussions show clear advantages in adopting tree-based classifiers for anomaly-based error and intrusion detection in tabular data as they outperform their competitors, including DNNs. Then, individual classifiers are compared against ensembles using different combinations of the classifiers considered in this study as base-learners, providing a unified final response through many meta-learning strategies. Results show that there is no benefit in building ensembles instead of using a tree-based classifier as Random Forests, eXtreme Gradient Boosting or Extra Trees. The paper concludes that anomaly-based error and intrusion detectors for critical systems should use the old (but gold) tree-based classifiers, which are also easier to fine-tune, and understand; plus, they require less time and resources to learn their model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future Generation Computer Systems	Publication Date: Jun 29, 2024
Citations: 2	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Anomaly-based error and intrusion detection in tabular data: No DNN outperforms tree-based classifiers

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Similar Papers

TLTD: Transfer Learning for Tabular Data
Maxim Bragilovski ... Shelly Levy-Tzedek
Applied Soft Computing | VOL. 147
Maxim Bragilovski, et. al.Maxim Bragilovski ... Shelly Levy-Tzedek
14 Aug 2023
Applied Soft Computing | VOL. 147

A Time Efficient Approach for Detecting Errors in Big Sensor Data on Cloud
Chi Yang ... Surya Nepal
IEEE Transactions on Parallel and Distributed Systems | VOL. 26
Chi Yang, et. al.Chi Yang ... Surya Nepal
01 Feb 2015
IEEE Transactions on Parallel and Distributed Systems | VOL. 26

An intrusion detection system for packet and flow based networks using deep neural network approach
Kaniz Farhana ... Maqsudur Rahman
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 10
Kaniz Farhana, et. al.Kaniz Farhana ... Maqsudur Rahman
01 Oct 2020
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 10

Deep Neural Networks and Tabular Data: A Survey.
Vadim Borisov ... Tobias Leemann
IEEE transactions on neural networks and learning systems | VOL. 35
Vadim Borisov, et. al.Vadim Borisov ... Tobias Leemann
01 Jun 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Anomaly-based error and intrusion detection in tabular data: No DNN outperforms tree-based classifiers

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems