Abstract

Aims.We present the first piece of evidence that adaptive learning techniques can boost the discovery of unusual objects within astronomical light curve data sets.Methods.Our method follows an active learning strategy where the learning algorithm chooses objects that can potentially improve the learner if additional information about them is provided. This new information is subsequently used to update the machine learning model, allowing its accuracy to evolve with each new piece of information. For the case of anomaly detection, the algorithm aims to maximize the number of scientifically interesting anomalies presented to the expert by slightly modifying the weights of a traditional isolation forest (IF) at each iteration. In order to demonstrate the potential of such techniques, we apply the Active Anomaly Discovery algorithm to two data sets: simulated light curves from the Photometric LSST Astronomical Time-series Classification Challenge (PLAsTiCC) and real light curves from the Open Supernova Catalog. We compare the Active Anomaly Discovery results to those of a static IF. For both methods, we performed a detailed analysis for all objects with the ∼2% highest anomaly scores.Results.We show that, in the real data scenario, Active Anomaly Discovery was able to identify ∼80% more true anomalies than the IF. This result is the first piece of evidence that active anomaly detection algorithms can play a central role in the search for new physics in the era of large-scale sky surveys.

Highlights

  • The detection of new astronomical sources is one of the most anticipated outcomes of the generation of large-scale sky surveys

  • Our goal with the real data analysis is to lower the burden inflicted by the machine learning (ML) algorithms on domain experts and propose a strategy that would improve the results presented in Pruzhinskaya et al (2019) while requiring the expert to confirm a lower number of sources

  • This figure was created considering objects in decreasing order of anomaly scores and following the order in which they were presented as candidates

Read more

Summary

Introduction

The detection of new astronomical sources is one of the most anticipated outcomes of the generation of large-scale sky surveys. The task of automatically identifying peculiar objects within a large set of normal instances has been highly explored in many areas of research (Aggarwal 2016). This has led to the development of a number of machine learning (ML) algorithms for anomaly detection (AD) with a large range of applications (Mehrotra et al 2017). These techniques have largely been applied to areas such as the identification

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call