The ATLAS Production System Evolution: New Data Processing and Analysis Paradigm for the LHC Run2 and High-Luminosity

F H Barreiro,D Golubkov,K De,T Maeno,R Mashinistov,S Padolski,M Borodin,T Wenaus,A Klimentov

doi:10.1088/1742-6596/898/5/052016

Abstract

The second generation of the ATLAS Production System called ProdSys2 is a distributed workload manager that runs daily hundreds of thousands of jobs, from dozens of different ATLAS specific workflows, across more than hundred heterogeneous sites. It achieves high utilization by combining dynamic job definition based on many criteria, such as input and output size, memory requirements and CPU consumption, with manageable scheduling policies and by supporting different kind of computational resources, such as GRID, clouds, supercomputers and volunteer-computers. The system dynamically assigns a group of jobs (task) to a group of geographically distributed computing resources. Dynamic assignment and resources utilization is one of the major features of the system, it didn’t exist in the earliest versions of the production system where Grid resources topology was predefined using national or/and geographical pattern. Production System has a sophisticated job fault-recovery mechanism, which efficiently allows to run multi-Terabyte tasks without human intervention. We have implemented “train” model and open-ended production which allow to submit tasks automatically as soon as new set of data is available and to chain physics groups data processing and analysis with central production by the experiment. We present an overview of the ATLAS Production System and its major components features and architecture: task definition, web user interface and monitoring. We describe the important design decisions and lessons learned from an operational experience during the first year of LHC Run2. We also report the performance of the designed system and how various workflows, such as data (re)processing, Monte-Carlo and physics group production, users analysis, are scheduled and executed within one production system on heterogeneous computing resources.

Highlights

– Designed to meet ATLAS production/analysis requirements for a data-driven workload management system capable of operating at LHC data processing scale
Web UI for Managers and Users provides the interface for task and Tasks%Requests%Layer:%Web%UI% production request managing and monitoring at the higher level
Model is represented by multilevel relational instances:

Summary

Introduction

– Designed to meet ATLAS production/analysis requirements for a data-driven workload management system capable of operating at LHC data processing scale. – Improved resource utilization – New types of computing resources: HPC, Clouds – Improved usability and robustness

ATLAS production system components

Production system data model and workAlows

Work with production request

ProducQon task list

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Oct 1, 2017
Citations: 9	License type: cc-by

R Discovery Prime

R Discovery Prime

The ATLAS Production System Evolution: New Data Processing and Analysis Paradigm for the LHC Run2 and High-Luminosity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Task Management in the New ATLAS Production System
K De ... D Golubkov
Journal of Physics: Conference Series | VOL. 513
K De, et. al.K De ... D Golubkov
11 Jun 2014
Journal of Physics: Conference Series | VOL. 513

Big Data Processing at Microsoft
Hiren Patel ... Clemens Szyperski
-
Hiren Patel, et. al.Hiren Patel ... Clemens Szyperski
20 Nov 2019
20 Nov 2019

How to keep the Grid full and working with ATLAS production and physics jobs
A Pacheco Pagés ... I Glushkov
Journal of Physics: Conference Series | VOL. 898
A Pacheco Pagés, et. al.A Pacheco Pagés ... I Glushkov
01 Oct 2017
Journal of Physics: Conference Series | VOL. 898

Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline
Joseph Slagel ... Robert L Moritz
Molecular & cellular proteomics : MCP | VOL. 14
Joseph Slagel, et. al.Joseph Slagel ... Robert L Moritz
01 Feb 2015
Molecular & cellular proteomics : MCP | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The ATLAS Production System Evolution: New Data Processing and Analysis Paradigm for the LHC Run2 and High-Luminosity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series