Abstract

Anomaly detection (AD) in images is a fundamental computer vision problem and refers to identifying images that deviate significantly from normality. State-of-the-art AD algorithms commonly learn a model of normality from scratch using task-specific datasets in either semisupervised or self-supervised manner. We follow an alternative approach and model the distribution of normal data in deep feature representations learned from ImageNet via a multivariate Gaussian (MVG). This lightweight approach achieves a new state of the art in AD on the public MVTec AD dataset. In addition to the empirical benefits, we give a clear motivation for the seemingly simplistic approach via the ties between deep generative and discriminative modeling revealed recently. We further elucidate why ImageNet representations are discriminative in the transfer learning AD setting using the principal component analysis. Here, we find that the principal components containing little variance in normal data are the ones crucial for discriminating between normal and anomalous instances, giving an explanation for the unreasonable effectiveness of our approach. We also investigate setting the working point of our approach by selecting acceptable false-positive rate thresholds based on the MVG assumption and the resistance of our approach to unlabeled anomalies in the dataset. Finally, we investigate whether our approach is prone to exploiting spurious correlations using explainable AI techniques. Code is publicly available at https://github.com/ORippler/gaussian-ad-mvtec.

Highlights

  • A NOMALY Detection (AD) relates to identifying instances in data that are significantly different to the norm [1]–[3]

  • We further show that the working point can be sensibly set based on choosing an acceptable False Positive Rate (FPR) under the multivariate Gaussian (MVG) assumption, which contrasts the empirical, heuristics-based working point estimation commonly applied in literature [4]

  • 2) Results: Assessing performance results, it becomes apparent that our proposed Gaussian anomaly detector achieves a new state of the art on the public MVTec AD dataset

Read more

Summary

Introduction

A NOMALY Detection (AD) relates to identifying instances in data that are significantly different to the norm [1]–[3]. AD tasks are defined by the following two characteristics:. Anomalies are rare events, i.e. their prevalence in the application domain is low. There exists limited knowledge about the anomaly distribution, i.e. it is not well-defined.. There exists limited knowledge about the anomaly distribution, i.e. it is not well-defined.1 Together, these characteristics result in AD datasets that are small and heavily imbalanced, often containing only few anomalies for model verification and testing. While ImageNet representations have been successfully used for AD [17]–[19], the ties between deep generative and discriminative models recently unveiled by [20] have not yet been leveraged to induce a strong prior for the normal data distribution

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call