Abstract

Novelty detection is a particular example of pattern recognition identifying patterns that departure from some model of “normal behaviour”. The classification of point patterns is considered that are defined as sets of N observations of a multivariate random variable X and where the value N follows a discrete stochastic distribution. The use of point process models is introduced that allow us to describe the length N as well as the geometrical configuration in data space of such patterns. It is shown that such infinite dimensional study can be translated into a one-dimensional study that is analytically tractable for a multivariate Gaussian distribution. Moreover, for other multivariate distributions, an analytic approximation is obtained, by the use of extreme value theory, to model point patterns that occur in low-density regions as defined by X. The proposed models are demonstrated on synthetic and real-world data sets.

Highlights

  • Novelty detection is the task of recognising test data that differ in some respect from the data that were available during training (Bishop, 1994); it is typically used when there is a large quantity of ‘‘normal’’ data available, but an insufficient quantity of ‘‘abnormal’’ data, preventing accurate estimation of the ‘‘abnormal’’ class in a two-class classification setting (Nitesh, 2005)

  • We provide an analytic approximation using extreme value theory (EVT) to model those point patterns that are situated in regions where the density defined by X is low

  • It can be shown that when the limit in (8) holds for the random variable Z for some scale parameter σ, the corresponding sequences of point process models (PPMs) of exceedancesZne will converge to a Poisson point process (PPP) for large u, meaning that the corresponding sequence of counting measures NAn associated withZne converge in distribution to a Poisson distribution: NAn d

Read more

Summary

Introduction

Novelty detection is the task of recognising test data that differ in some respect from the data that were available during training (Bishop, 1994); it is typically used when there is a large quantity of ‘‘normal’’ data available, but an insufficient quantity of ‘‘abnormal’’ data, preventing accurate estimation of the ‘‘abnormal’’ class in a two-class classification setting (Nitesh, 2005). The method introduced in this article is based on the development of point process models (PPMs) and provides a valid probabilistic interpretation of the degree of novelty of an entire point pattern x. This approach enables us to prevent those. We provide an analytic approximation using extreme value theory (EVT) to model those point patterns that are situated in regions where the density defined by X is low (the ‘extremes’ of the pattern) These regions are of particular importance because the decision boundary for novelty detection is typically situated at the edge of the support of X , where the density is low.

Problem setting
Novelty detection
Extreme value theory
Point process models
Point processes on a Euclidean space
Informal definition
Distribution and intensity measure
Finite point processes
Point processes of exceedances
Novelty detection for point patterns
Distributions of Janossy densities
Point patterns of multivariate normal distributions
Examples and special cases
Multivariate point patterns of exceedances
Experimental results
Capability of industrial processes
Predictive monitoring of patients
Online novelty detection
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.