A Semisupervised Approach to the Detection and Characterization of Outliers in Categorical Data.

Dino Ienco,Ruggero G Pensa,Rosa Meo

doi:10.1109/tnnls.2016.2526063

Abstract

In this paper, we introduce a new approach of semisupervised anomaly detection that deals with categorical data. Given a training set of instances (all belonging to the normal class), we analyze the relationship among features for the extraction of a discriminative characterization of the anomalous instances. Our key idea is to build a model that characterizes the features of the normal instances and then use a set of distance-based techniques for the discrimination between the normal and the anomalous instances. We compare our approach with the state-of-the-art methods for semisupervised anomaly detection. We empirically show that a specifically designed technique for the management of the categorical data outperforms the general-purpose approaches. We also show that, in contrast with other approaches that are opaque because their decision cannot be easily understood, our proposed approach produces a discriminative model that can be easily interpreted and used for the exploration of the data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Feb 17, 2016
Citations: 67	License type: other-oa

R Discovery Prime

R Discovery Prime

A Semisupervised Approach to the Detection and Characterization of Outliers in Categorical Data.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Similar Papers

An algorithm for mining outliers in categorical data through ranking
N N R Ranga Suri ... G Athithan
-
N N R Ranga Suri, et. al.N N R Ranga Suri ... G Athithan
01 Dec 2012
01 Dec 2012

A ranking-based algorithm for detection of outliers in categorical data
N.N.R Ranga Suri ... M Narasimha Murty
International Journal of Hybrid Intelligent Systems | VOL. 11
N.N.R Ranga Suri, et. al.N.N.R Ranga Suri ... M Narasimha Murty
29 Nov 2013
International Journal of Hybrid Intelligent Systems | VOL. 11

A Rough Clustering Algorithm for Mining Outliers in Categorical Data
N N R Ranga Suri ... Gopalasamy Athithan
-
N N R Ranga Suri, et. al.N N R Ranga Suri ... Gopalasamy Athithan
01 Jan 2013
01 Jan 2013

Outlier Analysis of Categorical Data using NAVF
D Lakshmi Sreenivasa Reddy ... A Govardhan
Informatica Economica | VOL. 17
D Lakshmi Sreenivasa Reddy, et. al.D Lakshmi Sreenivasa Reddy ... A Govardhan
30 Mar 2013
Informatica Economica | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Semisupervised Approach to the Detection and Characterization of Outliers in Categorical Data.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems