FOOR: Be Careful for Outlier-Score Outliers When Using Unsupervised Outlier Ensembles

Jiawei Yang,Sylwan Rahardja,Susanto Rahardja

doi:10.1109/tcss.2023.3280593

Abstract

Outlier detection is a very important tool in analyzing patterns and detecting unexpected events in social systems. However, the process of outlier detection could be fraught with uncertainty, with difficulties in determining the veracity of an object’s outlier score. We propose a framework for outlier-score outlier removal (FOOR). FOOR is a selection method, which aims to remove inaccurate outlier scores prior to data processing by ensemble techniques, to improve the accuracy of all ensembles. FOOR has rigorously tested with 30 real-world datasets and seven state-of-the-art ensembles over 25 different base detectors. Simulated experiments showed that FOOR significantly improves the existing techniques, with an average (AVG) of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$+$</tex-math> </inline-formula> 0.05 AUC (from 0.81 to 0.86 AUC). Thus, we recommend FOOR as the new standard for outlier-score preprocessing before ensembles.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FOOR: Be Careful for Outlier-Score Outliers When Using Unsupervised Outlier Ensembles

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems

Lead the way for us

Journal: IEEE Transactions on Computational Social Systems	Publication Date: Apr 1, 2024
Citations: 3

Similar Papers

Sparse Modeling-Based Sequential Ensemble Learning for Effective Outlier Detection in High-Dimensional Numeric Data
Guansong Pang ... Longbing Cao
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32
Guansong Pang, et. al.Guansong Pang ... Longbing Cao
29 Apr 2018
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 32

Modeling Outlier Score Distributions
Mohamed Bouguessa
-
Mohamed BouguessaMohamed Bouguessa
01 Jan 2012
01 Jan 2012

Outlier Detection on Mixed-Type Data: An Energy-Based Approach
Kien Do ... Truyen Tran
-
Kien Do, et. al.Kien Do ... Truyen Tran
01 Jan 2015
01 Jan 2015

Finding outliers using mutual nearness based ranks detection algorithm
Ram Niwas Gurjar ... Neeraj Sharma
-
Ram Niwas Gurjar, et. al.Ram Niwas Gurjar ... Neeraj Sharma
01 Feb 2014
01 Feb 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FOOR: Be Careful for Outlier-Score Outliers When Using Unsupervised Outlier Ensembles

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computational Social Systems