Abstract

BigPanDA monitoring is a web application that provides various processing and representation of the Production and Distributed Analysis (PanDA) system objects states. Analysing hundreds of millions of computation entities, such as an event or a job, BigPanDA monitoring builds different scales and levels of abstraction reports in real time mode. Provided information allows users to drill down into the reason of a concrete event failure or observe the broad picture such as tracking the computation nucleus and satellites performance or the progress of a whole production campaign. PanDA system was originally developed for the ATLAS experiment. Currently, it manages execution of more than 2 million jobs distributed over 170 computing centers worldwide on daily basis. BigPanDA is its core component commissioned in the middle of 2014 and now is the primary source of information for ATLAS users about the state of their computations and the source of decision support information for shifters, operators and managers. In this work, we describe the evolution of the architecture, current status and plans for the development of the BigPanDA monitoring.

Highlights

  • BigPanDA daily serves more than 17k queries

  • Recommendation system analyses prior visits of bigpanda.cern.ch by a user and suggests views and objects which in the focus of his/her interests

Read more

Summary

Web frontend to the Production and Distributed Analysis objects states

The ATLAS experiment uses PanDA (Production and Data Analysis) Workload Management System for managing the workflow for all data processing on over 150 data centers and opportunistic resources such as commercial clouds, supercomputers, volunteer machines. ● Since 2005 an effective interface to the heterogeneous distributed computing infrastructure ● 300k simultaneous jobs ● 2M jobs a day ● ~1500 users ● Processes over an exabyte of data per year ● Applications beyond ATLAS: COMPASS, AMS, (+ number of evaluations)

Failures investigations
Scales covered by BigPanDA monitoring
Jobs Summary
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call