Multilevel Workflow System in the ATLAS Experiment

M Borodin,A Klimentov,K De,J Garcia Navarro,T Maeno,D Golubkov,A Vaniachine

doi:10.1088/1742-6596/608/1/012015

Abstract

The ATLAS experiment is scaling up Big Data processing for the next LHC run using a multilevel workflow system comprised of many layers. In Big Data processing ATLAS deals with datasets, not individual files. Similarly a task (comprised of many jobs) has become a unit of the ATLAS workflow in distributed computing, with about 0.8M tasks processed per year. In order to manage the diversity of LHC physics (exceeding 35K physics samples per year), the individual data processing tasks are organized into workflows. For example, the Monte Carlo workflow is composed of many steps: generate or configure hard-processes, hadronize signal and minimum-bias (pileup) events, simulate energy deposition in the ATLAS detector, digitize electronics response, simulate triggers, reconstruct data, convert the reconstructed data into ROOT ntuples for physics analysis, etc. Outputs are merged and/or filtered as necessary to optimize the chain. The bi-level workflow manager - ProdSys2 - generates actual workflow tasks and their jobs are executed across more than a hundred distributed computing sites by PanDA - the ATLAS job-level workload management system. On the outer level, the Database Engine for Tasks (DEfT) empowers production managers with templated workflow definitions. On the next level, the Job Execution and Definition Interface (JEDI) is integrated with PanDA to provide dynamic job definition tailored to the sites capabilities. We report on scaling up the production system to accommodate a growing number of requirements from main ATLAS areas: Trigger, Physics and Data Preparation.

Highlights

Multilevel Workflow System in the ATLAS ExperimentM Borodin, K De2, J Garcia Navarro, D Golubkov, A Klimentov, T Maeno and A Vaniachine on behalf of the ATLAS Collaboration 1 Department of Elementary Particle Physics, National Research Nuclear University "MEPhI," Moscow, 117513, Russia 2 Physics Department, University of Texas Arlington, Arlington, TX 76019, United
The multi-purpose nature of the ATLAS experiment [1] at the LHC resulted in continuous growth in use cases for Big Data processing, as more data and new requirements emerge
To prepare the production for Run 2 challenges, PanDA has been upgraded with the Job Execution and Definition Interface (JEDI) [5], the production system enhanced with the Database Engine for Tasks (DEfT) [7], and the Distributed Data Management (DDM) system being upgraded to Rucio [8]

Summary

Multilevel Workflow System in the ATLAS Experiment

M Borodin, K De2, J Garcia Navarro, D Golubkov, A Klimentov, T Maeno and A Vaniachine on behalf of the ATLAS Collaboration 1 Department of Elementary Particle Physics, National Research Nuclear University "MEPhI," Moscow, 117513, Russia 2 Physics Department, University of Texas Arlington, Arlington, TX 76019, United. States of America 3 Instituto de Fisica Corpuscular, Universidad de Valencia, E-46980 Paterna, Spain 4 Experimental Physics Department, Institute for High Energy Physics, Protvino, 142281, Russia 5 Big Data Laboratory, National Research Centre "Kurchatov Institute" Moscow, 123182, Russia 6 Physics Department, Brookhaven National Laboratory, Bldg. 11973, United States of America 7 High Energy Physics Division, Argonne National Laboratory, 9700 South Cass

Introduction

Group production

Reprocessing campaign

Findings

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Apr 1, 2015
Citations: 7	License type: cc-by

R Discovery Prime

R Discovery Prime

Multilevel Workflow System in the ATLAS Experiment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Scaling up ATLAS production system for the LHC Run 2 and beyond: project ProdSys2
M Borodin ... D Golubkov
Journal of Physics: Conference Series | VOL. 664
M Borodin, et. al.M Borodin ... D Golubkov
01 Dec 2015
Journal of Physics: Conference Series | VOL. 664

Task Management in the New ATLAS Production System
K De ... A Vaniachine
Journal of Physics: Conference Series | VOL. 513
K De, et. al.K De ... A Vaniachine
11 Jun 2014
Journal of Physics: Conference Series | VOL. 513

Big data processing and analysis platform for condition monitoring of electric power system
Yuanjun Guo ... Kang Li
-
Yuanjun Guo, et. al.Yuanjun Guo ... Kang Li
01 Aug 2016
01 Aug 2016

Optimizing Data Processing: A Comparative Study of Big Data Platforms in Edge, Fog, and Cloud Layers
Thanda Shwe ... Masayoshi Aritsugi
Applied Sciences | VOL. 14
Thanda Shwe, et. al.Thanda Shwe ... Masayoshi Aritsugi
04 Jan 2024
Applied Sciences | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilevel Workflow System in the ATLAS Experiment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series