Design, Results, Evolution and Status of the ATLAS Simulation at Point1 Project

S Ballestrero,A Sedov,C J Lee,F Brasolin,D A Scannicchio,A Di Girolamo,M S Twomey,F Wang,M E Pozo Astigarraga,A Zaytsev,D Fazio,C Contescu,S M Batraneanu

doi:10.1088/1742-6596/664/2/022008

Abstract

During the LHC Long Shutdown 1 (LSI) period, that started in 2013, the Simulation at Point1 (Sim@P1) project takes advantage, in an opportunistic way, of the TDAQ (Trigger and Data Acquisition) HLT (High-Level Trigger) farm of the ATLAS experiment. This farm provides more than 1300 compute nodes, which are particularly suited for running event generation and Monte Carlo production jobs that are mostly CPU and not I/O bound. It is capable of running up to 2700 Virtual Machines (VMs) each with 8 CPU cores, for a total of up to 22000 parallel jobs. This contribution gives a review of the design, the results, and the evolution of the Sim@P1 project, operating a large scale OpenStack based virtualized platform deployed on top of the ATLAS TDAQ HLT farm computing resources. During LS1, Sim@P1 was one of the most productive ATLAS sites: it delivered more than 33 million CPU-hours and it generated more than 1.1 billion Monte Carlo events. The design aspects are presented: the virtualization platform exploited by Sim@P1 avoids interferences with TDAQ operations and it guarantees the security and the usability of the ATLAS private network. The cloud mechanism allows the separation of the needed support on both infrastructural (hardware, virtualization layer) and logical (Grid site support) levels. This paper focuses on the operational aspects of such a large system during the upcoming LHC Run 2 period: simple, reliable, and efficient tools are needed to quickly switch from Sim@P1 to TDAQ mode and back, to exploit the resources when they are not used for the data acquisition, even for short periods. The evolution of the central OpenStack infrastructure is described, as it was upgraded from Folsom to the Icehouse release, including the scalability issues addressed.

Highlights

Results, Evolution and Status of the ATLAS simulation in Point1 project
The simulation in Point1 project, based on OpenStack, uses in an opportunistic way the resources of the TDAQ High Level Trigger (HLT) farm of the ATLAS experiment
More than 1300 compute nodes (CNs) running up to 2700 VMs are exploited for running event generation and Monte Carlo production jobs, mostly CPU and not I/O bound, for a total of up to 22K parallel running jobs

Summary

Introduction

Results, Evolution and Status of the ATLAS simulation in Point1 project The simulation in Point1 project, based on OpenStack, uses in an opportunistic way the resources of the TDAQ High Level Trigger (HLT) farm of the ATLAS experiment. More than 1300 compute nodes (CNs) running up to 2700 VMs are exploited for running event generation and Monte Carlo production jobs, mostly CPU and not I/O bound, for a total of up to 22K parallel running jobs After the setup phase, during 2014 Sim@P1 was one of the most productive ATLAS sites, the farm was used for other activities: milestone run, technical run, infrastructure maintenance.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Dec 1, 2015
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Design, Results, Evolution and Status of the ATLAS Simulation at Point1 Project

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Sim@P1: Using Cloudscheduler for offline processing on the ATLAS HLT farm
...
EPJ Web of Conferences | VOL. 214
, et. al. ...
01 Jan 2019
EPJ Web of Conferences | VOL. 214

ATLAS Sim@P1 upgrades during long shutdown two
Frank Berghaus ... W Kamleh
EPJ Web of Conferences | VOL. 245
Frank Berghaus, et. al.Frank Berghaus ... W Kamleh
01 Jan 2020
EPJ Web of Conferences | VOL. 245

Infrastructure of the ATLAS Event Filter
Andrea Negri
-
Andrea NegriAndrea Negri
01 Jan 2007
01 Jan 2007

The Associative Memory Serial Link Processor of the ATLAS Fast TracKer Processing System
Calliope-Louisa Sotiropoulou
-
Calliope-Louisa SotiropoulouCalliope-Louisa Sotiropoulou
01 Oct 2017
01 Oct 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Design, Results, Evolution and Status of the ATLAS Simulation at Point1 Project

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series