Extreme value theory for anomaly detection \u2013 the GPD classifier

Edoardo Vignotto,Sebastian Engelke

doi:10.1007/s10687-020-00393-0

Abstract

Classification tasks usually assume that all possible classes are present during the training phase. This is restrictive if the algorithm is used over a long time and possibly encounters samples from unknown new classes. It is therefore fundamental to develop algorithms able to distinguish between normal and abnormal test data. In the last few years, extreme value theory has become an important tool in multivariate statistics and machine learning. The recently introduced extreme value machine, a classifier motivated by extreme value theory, addresses this problem and achieves competitive performance in specific cases. We show that this algorithm has some theoretical and practical drawbacks and can fail even if the recognition task is fairly simple. To overcome these limitations, we propose two new algorithms for anomaly detection relying on approximations from extreme value theory that are more robust in such cases. We exploit the intuition that test points that are extremely far from the training classes are more likely to be abnormal objects. We derive asymptotic results motivated by univariate extreme value theory that make this intuition precise. We show the effectiveness of our classifiers in simulations and on real data sets.

Highlights

Modern classifiers achieve human or super-human performance in a variety of tasks (Christopher 2016), including speech (Graves et al 2013) and image recognition (He et al 2016), but they are typically not able to discriminate between normal and abnormal classes and may give high confidence predictions for unrecognizable objectsE
We present two new kernel free algorithms that perform anomaly detection using extreme value theory
These algorithms, called the generalized Pareto distribution (GPD) classifier (GPDC) and the generalized extreme value (GEV) classifier (GEVC), are fast to update with the arrival of new data and they are easy to adapt to an incremental framework

Summary

Introduction

Modern classifiers achieve human or super-human performance in a variety of tasks (Christopher 2016), including speech (Graves et al 2013) and image recognition (He et al 2016), but they are typically not able to discriminate between normal and abnormal classes and may give high confidence predictions for unrecognizable objects. We underline that in this context standard hyper parameter optimization procedures such as cross-validation are usually not available, since in the training set there are only normal objects For this reason, an algorithm designed for anomaly detection should involve as few hyper parameters as possible. For this reason, we propose an alternative approach that uses extreme value theory overcomes this problem.

Related work

Extreme value theory

General setting

Algorithm description

Limitations of the EVM

The GPD classifier

Extreme value theory and anomaly detection

The GPDC algorithm

The GEV classifier

Application

Simulated data

OLETTER protocol

Diagnostics of thyroid disease

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Extremes	Publication Date: Sep 9, 2020
Citations: 14	License type: open-access

R Discovery Prime

R Discovery Prime

Extreme value theory for anomaly detection \u2013 the GPD classifier

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Extremes

Lead the way for us

Similar Papers

Open Set Radar HRRP Recognition Based on Random Forest and Extreme Value Theory
Yanhua Wang ... Xiaopeng Yang
-
Yanhua Wang, et. al.Yanhua Wang ... Xiaopeng Yang
01 Aug 2018
01 Aug 2018

Machine-Learning Implementation in Clinical Anesthesia: Opportunities and Challenges.
Danton S Char ... Alyssa Burgart
Anesthesia & Analgesia | VOL. 130
Danton S Char, et. al.Danton S Char ... Alyssa Burgart
01 Jun 2020
Anesthesia & Analgesia | VOL. 130

Detecting ADRD Caregivers’ Information Wants in Social Media: A Machine Learning–Aided Approach
Bo Xie ... Zhendong Wang
Innovation in Aging | VOL. 4
Bo Xie, et. al.Bo Xie ... Zhendong Wang
16 Dec 2020
Innovation in Aging | VOL. 4

Developing Machine Learning Skills With No-Code Machine Learning Tools
Emmanuel Djaba ... Joseph Budu
-
Emmanuel Djaba, et. al.Emmanuel Djaba ... Joseph Budu
14 Oct 2022
14 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extreme value theory for anomaly detection \u2013 the GPD classifier

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Extremes