OutRank: ranking outliers in high dimensional data

Emmanuel Muller,Thomas Seidl,Ira Assent,Uwe Steinhausen

doi:10.1109/icdew.2008.4498387

Abstract

Outlier detection is an important data mining task for consistency checks, fraud detection, etc. Binary decision making on whether or not an object is an outlier is not appropriate in many applications and moreover hard to parametrize. Thus, recently, methods for outlier ranking have been proposed. Determining the degree of deviation, they do not require setting a decision boundary between outliers and the remaining data. High dimensional and heterogeneous (continuous and categorical attributes) data, however, pose a problem for most outlier ranking algorithms. In this work, we propose our OutRank approach for ranking outliers in heterogeneous high dimensional data. We introduce a consistent model for different attribute types. Our novel scoring functions transform the analyzed structure of the data to a meaningful ranking. Promising results in preliminary experiments show the potential for successful outlier ranking in high dimensional data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

OutRank: ranking outliers in high dimensional data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Detecting and ranking outliers in high-dimensional data
Amardeep Kaur ... Amitava Datta
International Journal of Advances in Engineering Sciences and Applied Mathematics | VOL. 11
Amardeep Kaur, et. al.Amardeep Kaur ... Amitava Datta
14 Dec 2018
International Journal of Advances in Engineering Sciences and Applied Mathematics | VOL. 11

Outliers in High Dimensional Data
N N R Ranga Suri ... G Athithan
-
N N R Ranga Suri, et. al.N N R Ranga Suri ... G Athithan
01 Jan 2019
01 Jan 2019

Improving the Accuracy of Convolutional Neural Networks by Identifying and Removing Outlier Images in Datasets Using t-SNE
Husein Perez ... Joseph H M Tah
Mathematics | VOL. 8
Husein Perez, et. al.Husein Perez ... Joseph H M Tah
27 Apr 2020
Mathematics | VOL. 8

An Efficient Method to Detect Outliers in High Dimensional Data
Atul Garg ...
Journal of Computational and Theoretical Nanoscience | VOL. 16
Atul Garg, et. al.Atul Garg ...
01 Sep 2019
Journal of Computational and Theoretical Nanoscience | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

OutRank: ranking outliers in high dimensional data

Abstract

Talk to us

Similar Papers