An iterative approach to unsupervised outlier detection using ensemble method and distance-based data filtering

Bodhan Chakraborty,Ram Sarkar,Samir Malakar,Agneet Chaterjee

doi:10.1007/s40747-022-00674-0

Bodhan Chakraborty, Ram Sarkar + Show 2 more

Open Access

https://doi.org/10.1007/s40747-022-00674-0

Copy DOI

Abstract

Outlier or anomaly detection is the process through which datum/data with different properties from the rest of the data is/are identified. Their importance lies in their use in various domains such as fraud detection, network intrusion detection, and spam filtering. In this paper, we introduce a new outlier detection algorithm based on an ensemble method and distance-based data filtering with an iterative approach to detect outliers in unlabeled data. The ensemble method is used to cluster the unlabeled data and to filter out potential isolated outliers from the same by iteratively using a cluster membership threshold until the Dunn index score for clustering is maximized. The distance-based data filtering, on the other hand, removes the potential outlier clusters from the post-clustered data based on a distance threshold using the Euclidean distance measure of each data point from the majority cluster as the filtering factor. The performance of our algorithm is evaluated by applying it to 10 real-world machine learning datasets. Finally, we compare the results of our algorithm to various supervised and unsupervised outlier detection algorithms using Precision@n and F-score evaluation metrics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Complex & Intelligent Systems	Publication Date: Feb 17, 2022
Citations: 9	License type: open-access

R Discovery Prime

R Discovery Prime

An iterative approach to unsupervised outlier detection using ensemble method and distance-based data filtering

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

Interpretable Single-dimension Outlier Detection (ISOD): An Unsupervised Outlier Detection Method Based on Quantiles and Skewness Coefficients
Yuehua Huang ... Wen Chen
Applied Sciences | VOL. 14
Yuehua Huang, et. al.Yuehua Huang ... Wen Chen
22 Dec 2023
Applied Sciences | VOL. 14

Unsupervised Outlier Detection Mechanism for Tea Traceability Data
Honggang Yang ... Shaowen Li
IEEE Access | VOL. 10
Honggang Yang, et. al.Honggang Yang ... Shaowen Li
01 Jan 2021
IEEE Access | VOL. 10

Unsupervised Outlier Detection: A Meta-Learning Algorithm Based on Feature Selection
Vasilis Papastefanopoulos ... Sotiris Kotsiantis
Electronics | VOL. 10
Vasilis Papastefanopoulos, et. al.Vasilis Papastefanopoulos ... Sotiris Kotsiantis
12 Sep 2021
Electronics | VOL. 10

Anomaly Based Network Intrusion Detection with Unsupervised Outlier Detection
Jiong Zhang ... Mohammad Zulkernine
-
Jiong Zhang, et. al.Jiong Zhang ... Mohammad Zulkernine
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An iterative approach to unsupervised outlier detection using ensemble method and distance-based data filtering

Abstract

Talk to us

Similar Papers

More From: Complex &amp; Intelligent Systems

More From: Complex & Intelligent Systems