Combining Entropy Measures for Anomaly Detection.

Alberto Muñoz,Gabriel Martos,Nicolás Hernández,Javier M Moguerza

doi:10.3390/e20090698

Abstract

The combination of different sources of information is a problem that arises in several situations, for instance, when data are analysed using different similarity measures. Often, each source of information is given as a similarity, distance, or a kernel matrix. In this paper, we propose a new class of methods which consists of producing, for anomaly detection purposes, a single Mercer kernel (that acts as a similarity measure) from a set of local entropy kernels and, at the same time, avoids the task of model selection. This kernel is used to build an embedding of data in a variety that will allow the use of a (modified) one-class Support Vector Machine to detect outliers. We study several information combination schemes and their limiting behaviour when the data sample size increases within an Information Geometry context. In particular, we study the variety of the given positive definite kernel matrices to obtain the desired kernel combination as belonging to that variety. The proposed methodology has been evaluated on several real and artificial problems.

Highlights

Usual Data Mining tasks, such as classification, regression and anomaly detection, are heavily dependent on the geometry of the underlying data space
We explore linear combinations and Karcher means, to validate the intuition that the use of a more natural mean than the arithmetic mean will produce better practical results, as far as positive definite matrices are involved
We have explored how to combine different sources of information for anomaly detection within the framework of Entropy measures

Summary

Introduction

Usual Data Mining tasks, such as classification, regression and anomaly detection, are heavily dependent on the geometry of the underlying data space. Machines (SVM), provide the control on the data space geometry through the use of a Mercer kernel function [1,2]. The choice of the appropriate kernel, including its parameters, is a particular case of model selection problems. A typical way to proceed is by means of cross-validation procedures [5] These parameter calibration strategies, intuitive and simple from an applied point of view, have some important drawbacks. An appealing alternative to model selection when working with SVM is to combine or merge different kernel functions into a single kernel [6,7]. The paper is organized as follows: Section 2 describes the functional data analysis methods used to produce the data representations from kernels, as well as the minimum entropy method used in this paper for anomaly detection.

Reproducing Kernel Hilbert Spaces for Multivariate and Functional Data

Local Entropy Kernels

Kernel Combination for Anomaly Detection

Entropy Weighting

Karcher Mean

Experimental Section

Synthetic Data

Real Data

Robustness of the Karcher Mean

Discussion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combining Entropy Measures for Anomaly Detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Journal: Entropy	Publication Date: Sep 12, 2018
License type: CC BY 4.0

Similar Papers

Methods for the combination of kernel matrices within a support vector framework
Isaac Martín De Diego ... Alberto Muñoz
Machine Learning | VOL. 78
Isaac Martín De Diego, et. al.Isaac Martín De Diego ... Alberto Muñoz
14 Aug 2009
Machine Learning | VOL. 78

Label-based multiple kernel learning for classification
Bing Yang ... Lujia Song
-
Bing Yang, et. al. Bing Yang ... Lujia Song
01 Jan 2013
01 Jan 2013

Inference on stiffness and strength of existing chestnut timber elements using Hierarchical Bayesian Probability Networks
Hélder S Sousa ... Paulo B Lourenço
Materials and Structures | VOL. 49
Hélder S Sousa, et. al.Hélder S Sousa ... Paulo B Lourenço
26 Dec 2015
Materials and Structures | VOL. 49

Rational prescribing and sources of information
Flora Haayer
Social Science & Medicine | VOL. 16
Flora HaayerFlora Haayer
01 Jan 1981
Social Science & Medicine | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining Entropy Measures for Anomaly Detection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy