Running ATLAS workloads within massively parallel distributed applications using Athena Multi-Process framework (AthenaMP)

Paolo Calafiura,Vakhtang Tsulaia,Charles Leggett,Rolf Seuster,Peter Van Gemmeren

doi:10.1088/1742-6596/664/7/072050

Paolo Calafiura, Vakhtang Tsulaia + Show 3 more

Open Access

https://doi.org/10.1088/1742-6596/664/7/072050

Copy DOI

Abstract

AthenaMP is a multi-process version of the ATLAS reconstruction, simulation and data analysis framework Athena. By leveraging Linux fork and copy-on-write mechanisms, it allows for sharing of memory pages between event processors running on the same compute node with little to no change in the application code. Originally targeted to optimize the memory footprint of reconstruction jobs, AthenaMP has demonstrated that it can reduce the memory usage of certain configurations of ATLAS production jobs by a factor of 2. AthenaMP has also evolved to become the parallel event-processing core of the recently developed ATLAS infrastructure for fine-grained event processing (Event Service) which allows the running of AthenaMP inside massively parallel distributed applications on hundreds of compute nodes simultaneously. We present the architecture of AthenaMP, various strategies implemented by AthenaMP for scheduling workload to worker processes (for example: Shared Event Queue and Shared Distributor of Event Tokens) and the usage of AthenaMP in the diversity of ATLAS event processing workloads on various computing resources: Grid, opportunistic resources and HPC.

Highlights

AthenaMP leverages Linux Copy-On-Write for sharing memory pages between processes, which were forked from the same master process
The plot shows the number of CPU-cores used by ATLAS production jobs – serial and MP – on the Grid during one week in February-March 2015
For the Event Service AthenaMP uses the strategy of distributing event tokens to the worker processes

Summary

History of AthenaMP

✔ ATLAS reconstruction is memory-hungry ✔ We needed to have a mechanism for optimizing memory footprint without touching the algorithmic code-base. AthenaMP leverages Linux Copy-On-Write for sharing memory pages between processes, which were forked from the same master process. ✔ Presented at CHEP 2009: “Harnessing multicores: strategies and implementations in ATLAS”, S.Binet et al. Being actively used for running ATLAS production jobs on multi-core resources on the Grid. The master process goes through the initialization phase and forks N sub-processes (workers)

Schematic View of ATLAS AthenaMP

Assigning workloads to the worker processes

Number of CPU cores used by ATLAS production jobs

AthenaMP and the ATLAS Event Service

AthenaMP and Yoda

Future developments

Summary

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Dec 1, 2015
Citations: 15	License type: cc-by

R Discovery Prime

R Discovery Prime

Running ATLAS workloads within massively parallel distributed applications using Athena Multi-Process framework (AthenaMP)

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Accelerating Complex Event Processing through GPUs
Prabodha Srimal Rodrigo ... H M N Dilum Bandara
-
Prabodha Srimal Rodrigo, et. al.Prabodha Srimal Rodrigo ... H M N Dilum Bandara
01 Dec 2015
01 Dec 2015

Mythbusters
Tim Bass
-
Tim BassTim Bass
20 Jun 2007
20 Jun 2007

Raythena: a vertically integrated scheduler for ATLAS applications on heterogeneous distributed resources
Miha Muškinja ... L Silvestris
EPJ Web of Conferences | VOL. 245
Miha Muškinja, et. al.Miha Muškinja ... L Silvestris
01 Jan 2020
EPJ Web of Conferences | VOL. 245

The Semantic Complex Event Processing Based on Metagraph Approach
Yuriy E Gapanyuk
-
Yuriy E GapanyukYuriy E Gapanyuk
17 Jul 2019
17 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Running ATLAS workloads within massively parallel distributed applications using Athena Multi-Process framework (AthenaMP)

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series