New hybrid ensemble method for anomaly detection in data science

Amina Mohamed Elmahalwy,Khalid M Amin,Hayam M Mousa

doi:10.11591/ijece.v13i3.pp3498-3508

Amina Mohamed Elmahalwy, Khalid M Amin + Show 1 more

Open Access

https://doi.org/10.11591/ijece.v13i3.pp3498-3508

Copy DOI

Abstract

Anomaly detection is a significant research area in data science. Anomaly detection is used to find unusual points or uncommon events in data streams. It is gaining popularity not only in the business world but also in different of other fields, such as cyber security, fraud detection for financial systems, and healthcare. Detecting anomalies could be useful to find new knowledge in the data. This study aims to build an effective model to protect the data from these anomalies. We propose a new hyper ensemble machine learning method that combines the predictions from two methodologies the outcomes of isolation forest-k-means and random forest using a voting majority. Several available datasets, including KDD Cup-99, Credit Card, Wisconsin Prognosis Breast Cancer (WPBC), Forest Cover, and Pima, were used to evaluate the proposed method. The experimental results exhibit that our proposed model gives the highest realization in terms of receiver operating characteristic performance, accuracy, precision, and recall. Our approach is more efficient in detecting anomalies than other approaches. The highest accuracy rate achieved is 99.9%, compared to accuracy without a voting method, which achieves 97%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

New hybrid ensemble method for anomaly detection in data science

Abstract

Talk to us

Similar Papers

More From: International Journal of Electrical and Computer Engineering (IJECE)

Lead the way for us

Journal: International Journal of Electrical and Computer Engineering (IJECE)	Publication Date: Jun 1, 2023
License type: CC BY-SA 4.0

Similar Papers

Distributed Sequence Pattern Detection Over Multiple Data Streams
Ahmed Khan Leghari ... Jianneng Cao
-
Ahmed Khan Leghari, et. al.Ahmed Khan Leghari ... Jianneng Cao
01 Jan 2015
01 Jan 2015

A Family of Joint Sparse PCA Algorithms for Anomaly Localization in Network Data Streams
Ruoyi Jiang ... Jun Huan
IEEE Transactions on Knowledge and Data Engineering | VOL. 25
Ruoyi Jiang, et. al.Ruoyi Jiang ... Jun Huan
01 Nov 2013
IEEE Transactions on Knowledge and Data Engineering | VOL. 25

Data stream event prediction based on timing knowledge and state transitions
Yan Li ... Tingjian Ge
Proceedings of the VLDB Endowment | VOL. 13
Yan Li, et. al.Yan Li ... Tingjian Ge
01 Jun 2020
Proceedings of the VLDB Endowment | VOL. 13

Control-flow discovery from event streams
Andrea Burattin ... Alessandro Sperduti
-
Andrea Burattin, et. al.Andrea Burattin ... Alessandro Sperduti
01 Jul 2014
01 Jul 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New hybrid ensemble method for anomaly detection in data science

Abstract

Talk to us

Similar Papers

More From: International Journal of Electrical and Computer Engineering (IJECE)