Abstract

With the advent of increased computational resources and improved algorithms, machine learning-based models are being increasingly applied to complex problems in particle accelerators. However, such data-driven models may provide overly confident predictions with unknown errors and uncertainties. For reliable deployment of machine learning models in high-regret and safety-critical systems such as particle accelerators, estimates of prediction uncertainty are needed along with accurate point predictions. In this investigation, we evaluate Bayesian neural networks (BNN) as an approach that can provide accurate predictions along with reliably quantified uncertainties for particle accelerator problems, and compare their performance with bootstrapped ensembles of neural networks. We select three accelerator setups for this evaluation: a storage ring, a photoinjector, and a linac. The problems span different data volumes and dimensionalities (e.g., scalar predictions as well as image outputs). It is found that BNN provide accurate predictions of the mean along with reliable estimates of predictive uncertainty across the test cases. In this vein, BNN may offer an attractive alternative to deterministic deep learning tools to generate accurate predictions with quantified uncertainties in particle accelerator applications.

Highlights

  • AND MOTIVATIONParticle accelerators represent some of the most complex and data intensive systems in the scientific realm

  • We evaluate the ability of Bayesian neural networks (BNN) to provide accurate predictions of the mean, as well as generate reliable estimates for the predictive uncertainty

  • Addressing the computational cost of utilizing bootstrapped ensembles and BNNs, we find that the CPU times for the BNN is 9930 s and for the bootstrapped ensemble is 12030 s

Read more

Summary

INTRODUCTION

Particle accelerators represent some of the most complex and data intensive systems in the scientific realm. Deterministic neural networks are unable to recognize out-of-sample examples and habitually make erroneous predictions for such cases with high confidence [16,17,18] Such uncertainty in predictions has had grave consequences while applying deep learning to high-regret applications. SLAC is integrating machine learning-based models and modelbased control into operation of its accelerators In this context, neural networks in particular are appealing for high dimensional sets of scalar or beam image inputs and outputs. Deterministic neural networks make predictions with a high degree of theoretical confidence In accelerator applications, such instances can arise due to the nonstationary nature of the relationships to be modeled due to drift, regions of data sparsity due to the high dimension of the parameter spaces, and changes to the machine component configurations to support different experiments.

MATHEMATICAL DETAILS
COMPARISON OF DIFFERENT APPROACHES FOR ACCELERATOR TEST CASES
LCLS-II injector
LCLS electron beamline
OOD ROBUSTNESS
EXTENSION TO PREDICTION OF BEAM PHASE SPACE IMAGES
SUMMARY & CONCLUSIONS
Mathematical formalism of Bayesian neural networks
Findings
Mathematical formalism for Bootstrapped ensembles
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call