Abstract

ABSTRACT The latest Fermi-LAT gamma-ray catalogue, 4FGL-DR3, presents a large fraction of sources without clear association to known counterparts, i.e. unidentified sources (unIDs). In this paper, we aim to classify them using machine learning algorithms, which are trained with the spectral characteristics of associated sources to predict the class of the unID population. With the state-of-the-art catboost algorithm, based on gradient boosting decision trees, we are able to reach a 67 per cent accuracy on a 23-class data set. Removing a single of these classes – blazars of uncertain type – increases the accuracy to 81 per cent. If interested only in a binary AGN/pulsar distinction, the model accuracy is boosted up to 99 per cent. Additionally, we perform an unsupervised search among both known and unID population, and try to predict the number of clusters of similar sources, without prior knowledge of their classes. The full code used to perform all calculations is provided as an interactive python notebook.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.