Learning to rank anomalies: scalar performance criteria and maximization of rank statistics

Myrto Limnios,Nathan Noiry,Stephan Clémençon

doi:10.1007/s10994-024-06609-9

Abstract

AbstractThe ability to collect and store ever more massive data, unlabeled in many cases, has been accompanied by the need to process them efficiently in order to extract relevant information and possibly design solutions based on the latter. In various situations, the vast majority of the observations exhibit the same behavior, while a small proportion deviates from it. Detecting these outlier observations (or equivalently defined as anomalies) is now one of the major challenges for machine learning applications (e.g. fraud detection or predictive maintenance). We propose here a novel methodology for outlier/anomaly detection, by learning a scoring function defined on the feature space allowing for ranking the observations by degree of abnormality. The scoring function is built through maximization of an empirical performance criterion taking the form of a (two-sample) linear rank statistic. We show that bipartite ranking algorithms can thus be used to learn nearly optimal scoring function with provable theoretical guarantees. We illustrate our methodology with numerical experiments based on open access online code.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning to rank anomalies: scalar performance criteria and maximization of rank statistics

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Oct 21, 2024
License type: CC BY 4.0

Similar Papers

Local Asymptotics for Linear Rank Statistics with Estimated Score Functions
Georg Neuhaus
The Annals of Statistics | VOL. 15
Georg NeuhausGeorg Neuhaus
01 Jun 1987
The Annals of Statistics | VOL. 15

Applications
Konrad Behnen ... Georg Neuhaus
-
Konrad Behnen, et. al.Konrad Behnen ... Georg Neuhaus
01 Jan 1989
01 Jan 1989

Combining Alternative Rank Tests for the Multiple Regression Problem
Shoutir Kishore Chatterjee ... Tathagata Banerjee
Calcutta Statistical Association Bulletin | VOL. 35
Shoutir Kishore Chatterjee, et. al.Shoutir Kishore Chatterjee ... Tathagata Banerjee
01 Sep 1986
Calcutta Statistical Association Bulletin | VOL. 35

Rank-based testing of equal survivorship based on cross-sectional survival data with or without prospective follow-up.
Kwun Chuen Gary Chan ... Jing Qin
Biostatistics (Oxford, England) | VOL. 16
Kwun Chuen Gary Chan, et. al.Kwun Chuen Gary Chan ... Jing Qin
25 Mar 2015
Biostatistics (Oxford, England) | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to rank anomalies: scalar performance criteria and maximization of rank statistics

Abstract

Talk to us

Similar Papers

More From: Machine Learning