Masked particle modeling on sets: towards self-supervised high energy physics foundation models

Tobias Golling,Lukas Heinrich,Michael Kagan,Samuel Klein,Matthew Leigh,Margarita Osadchy,John Andrew Raine

doi:10.1088/2632-2153/ad64a8

Abstract

Abstract We propose masked particle modeling (MPM) as a self-supervised method for learning generic, transferable, and reusable representations on unordered sets of inputs for use in high energy physics (HEP) scientific data. This work provides a novel scheme to perform masked modeling based pre-training to learn permutation invariant functions on sets. More generally, this work provides a step towards building large foundation models for HEP that can be generically pre-trained with self-supervised learning and later fine-tuned for a variety of down-stream tasks. In MPM, particles in a set are masked and the training objective is to recover their identity, as defined by a discretized token representation of a pre-trained vector quantized variational autoencoder. We study the efficacy of the method in samples of high energy jets at collider physics experiments, including studies on the impact of discretization, permutation invariance, and ordering. We also study the fine-tuning capability of the model, showing that it can be adapted to tasks such as supervised and weakly supervised jet classification, and that the model can transfer efficiently with small fine-tuning data sets to new classes and new data domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning: Science and Technology	Publication Date: Sep 1, 2024
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Masked particle modeling on sets: towards self-supervised high energy physics foundation models

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology

Lead the way for us

Similar Papers

Marginal permutation invariant covariance matrices with applications to linear models
Tatjana Nahtman
Linear Algebra and its Applications | VOL. 417
Tatjana NahtmanTatjana Nahtman
18 Apr 2006
Linear Algebra and its Applications | VOL. 417

Resolving mean-field solutions of dissipative phase transitions using permutational symmetry
Minjae Jo ... B Kahng
Chaos, Solitons and Fractals: the interdisciplinary journal of Nonlinear Science, and Nonequilibrium and Complex Phenomena | VOL. 173
Minjae Jo, et. al.Minjae Jo ... B Kahng
25 Jun 2023
Chaos, Solitons and Fractals: the interdisciplinary journal of Nonlinear Science, and Nonequilibrium and Complex Phenomena | VOL. 173

Permutation invariant encodings for quantum machine learning with point cloud data
Jamie Heredge ... Martin Sevior
Quantum Machine Intelligence | VOL. 6
Jamie Heredge, et. al.Jamie Heredge ... Martin Sevior
02 May 2024
Quantum Machine Intelligence | VOL. 6

PS^2-Net: A Locally and Globally Aware Network for Point-Based Semantic Segmentation

-

29 Dec 2020
29 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Masked particle modeling on sets: towards self-supervised high energy physics foundation models

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology