Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana

Thomas Beermann,Michal Svatos,Aleksandr Alekseev,Sabine Crépé-Renaudin,Petr Vokac,Helmut Wolters,Ivan Glushkov,Armen Vartapetian,Dario Baberis,Johannes Elmsheuser

doi:10.1051/epjconf/202024503031

Abstract

For the last 10 years, the ATLAS Distributed Computing project has based its monitoring infrastructure on a set of custom designed dashboards provided by CERN. This system functioned very well for LHC Runs 1 and 2, but its maintenance has progressively become more difficult and the conditions for Run 3, starting in 2021, will be even more demanding; hence a more standard code base and more automatic operations are needed. A new infrastructure has been provided by CERN, based on InfluxDB as the data store and Grafana as the display environment. ATLAS has adapted and further developed its monitoring tools to use this infrastructure for data and workflow management monitoring and accounting dashboards, expanding the range of previous possibilities with the aim to achieve a single, simpler, environment for all monitoring applications. This document describes these tools and the data flows for monitoring and accounting.

Highlights

The ATLAS [1] Distributed Computing (ADC) uses two core-systems to run jobs on the grid and manage the data - the workflow management system PanDA [2] and the distributed data management system Rucio [3]
During the LHC Run 1 and Run 2 the monitoring and accounting systems were based on custom frameworks developed by CERN IT and ADC and had been in use for 10 years
The data collection, processing and display presented in this document are based on Unified Monitoring Infrastructure (UMA)

Summary

Introduction

The ATLAS [1] Distributed Computing (ADC) uses two core-systems to run jobs on the grid and manage the data - the workflow management system PanDA [2] and the distributed data management system Rucio [3]. During the LHC Run 1 and Run 2 the monitoring and accounting systems were based on custom frameworks developed by CERN IT and ADC and had been in use for 10 years. These systems served well during that time but they started to show their age in several areas; in particular, the. The original developers had long left and with them a lot of the in-depth knowledge necessary to further optimise the system and the ability to quickly add new features to the monitoring. In 2016 the CERN MonIT group started to build a new Unified Monitoring Infrastructure (UMA) based on open source technology [4]. The data collection, processing and display presented in this document are based on UMA

The CERN MonIT Infrastructure

Processing

Backends

Dashboards

Transfer Dashboard

Site Accounting

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EPJ web of conferences	Publication Date: Jan 1, 2020
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ web of conferences

Lead the way for us

Similar Papers

The design of monitoring and data infrastructures — Applying a forward-thinking reference architecture
M Schroeder ... V Stender
-
M Schroeder, et. al.M Schroeder ... V Stender
01 Apr 2013
01 Apr 2013

Distributed optical fiber sensors for integrated monitoring of railway infrastructures
Aldo Minardo ... Daniele Giannetta
Structural Monitoring and Maintenance | VOL. 1
Aldo Minardo, et. al.Aldo Minardo ... Daniele Giannetta
25 Jun 2014
Structural Monitoring and Maintenance | VOL. 1

New data infrastructures for environmental monitoring in Myanmar: Is digital transparency good for governance?
Jenny E Goldstein ... Hilary Oliva Faxon
Environment and planning. E, Nature and space | VOL. 5
Jenny E Goldstein, et. al.Jenny E Goldstein ... Hilary Oliva Faxon
24 Jul 2020
Environment and planning. E, Nature and space | VOL. 5

'Internet stvari' kao inovativna tehnologija sa primenom u maloprodaji
Sonja Vučenović
Anali Ekonomskog fakulteta u Subotici | VOL. -
Sonja VučenovićSonja Vučenović
01 Jan 2018
Anali Ekonomskog fakulteta u Subotici | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implementation of ATLAS Distributed Computing monitoring dashboards using InfluxDB and Grafana

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ web of conferences