Detection of local and clustered outliers based on the density–distance decision graph

Kangsheng Li,Xin Gao,Xin Jia,Bing Xue,Shiyuan Fu,Zhiyu Liu,Xu Huang,Zijian Huang

doi:10.1016/j.engappai.2022.104719

Abstract

Outlier detection tasks refer to identifying the objects that have different characteristics from the normal observations. Most existing approaches detect outliers from the global perspective, which can effectively detect global outliers and most clustered outliers but cannot detect local outliers when the normal samples form clusters with different densities. The methods based on local outlier factors can effectively detect local outliers, but when the number of outliers increases, the more occurrences of clustered outliers will lead to the degeneration of the detection performance. We proposed an outlier detection method based on density–distance decision graph to detect local, global and clustered outliers simultaneously. Firstly, kernel density estimation and local reachable distance are combined to calculate the local density. The density ratio of the neighbors of an instance to itself is calculated as the degree of local outliers. Then, we propose a metric named density lifting distance as the degree of global outliers, which is calculated by the distance between k nearest neighbors with higher density of the instance and itself. The density ratio and density lift distance are combined to draw the density–distance decision graph, and the product of two metrics is calculated as the final outlier score. Comprehensive experiments were conducted on 8 synthetic datasets and 16 real-world datasets compared with 12 state-of-the-art methods. The results show that the proposed method works well when the samples form clusters with different densities as well as the percentage of outliers varies, and outperforms the state-of-the-art methods tested in terms of AUC.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detection of local and clustered outliers based on the density–distance decision graph

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Feb 11, 2022
Citations: 16

Similar Papers

A novel outlier detecting algorithm based on the outlier turning points
Jinlong Huang ... Sulan Zhang
Expert Systems with Applications | VOL. 231
Jinlong Huang, et. al.Jinlong Huang ... Sulan Zhang
01 Nov 2023
Expert Systems with Applications | VOL. 231

Coulomb’s law-inspired parameter-free outlier detection algorithm
Rui Pu ... Dongming Tang
Applied Soft Computing | VOL. -
Rui Pu, et. al.Rui Pu ... Dongming Tang
01 Oct 2024
Applied Soft Computing | VOL. -

An ensemble-based outlier detection method for clustered and local outliers with differential potential spread loss
Xin Gao ... Guangyao Zhang
Knowledge-Based Systems | VOL. 258
Xin Gao, et. al.Xin Gao ... Guangyao Zhang
17 Oct 2022
Knowledge-Based Systems | VOL. 258

A hybrid algorithm for mining local outliers in categorical data
Meiling Liu ... Weidong Tang
International Journal of Wireless and Mobile Computing | VOL. 13
Meiling Liu, et. al.Meiling Liu ... Weidong Tang
01 Jan 2017
International Journal of Wireless and Mobile Computing | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of local and clustered outliers based on the density–distance decision graph

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence