ALICE Run 3 Analysis Framework

Anton Alkin,Jan Fiete Grosse-Oetringhaus,Maja Kabus,Peter Hristov,Giulio Eulisse,C Biscarat,G.A Stewart,C.I Rovelli,S Campana,S Roiser,B Hegner

doi:10.1051/epjconf/202125103063

Anton Alkin, Jan Fiete Grosse-Oetringhaus + Show 9 more

Open Access

https://doi.org/10.1051/epjconf/202125103063

Copy DOI

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2021
Citations: 4	License type: CC BY 4.0

Affiliation: Warsaw University of Technology

Abstract

In LHC Run 3 the ALICE Collaboration will have to cope with an increase of lead-lead collision data of two orders of magnitude compared to the Run 1 and 2 data-taking periods. The Online-Offline (O2) software framework has been developed to allow for distributed and efficient processing of this unprecedented amount of data. Its design, which is based on a message-passing back end, required the development of a dedicated Analysis Framework that uses the columnar data format provided by Apache Arrow. The O2 Analysis Framework provides a user-friendly high-level interface and hides the complexity of the underlying distributed framework. It allows the users to access and manipulate the data in the new format both in the traditional “event loop” and a declarative approach using bulk processing operations based on Arrow’s Gandiva sub-project. Building on the well-tested system of analysis trains developed by ALICE in Run 1 and 2, the AliHyperloop infrastructure is being developed. It provides a fast and intuitive user interface for running demanding analysis workflows in the GRID environment and on the dedicated Analysis Facility. In this document, we report on the current state and ongoing developments of the Analysis Framework and of AliHyperloop, highlighting the design choices and the benefits of the new system.

Highlights

ALICE in Run 3 will run in so-called continuous data-taking mode, with the unit of information being a snapshot of data in a 10 ms-long time window, dubbed timeframe
In order to fully exploit the potential of the Data Processing Layer (DPL) for physics analyses, additional developments were required, resulting in the creation of the Analysis Framework as an extension of the DPL
The system, called AliHyperloop, can benchmark each analysis in terms of functionality and resource consumption and compose trains that are later submitted to the GRID or a dedicated Analysis Facility - a specialized Grid site with CPU and disk resources adjusted for analysis needs and small fraction of data pre-staged locally

Summary

Core design

The analysis data model in Run 3 is a collection of flat tables, arranged in a relational database-like structure using index connections. The particular data processors, known as tasks for similarity with the previous analysis framework, are created by the end users and provide a C++ structure with conventionally defined callbacks and declarations of inputs/outputs. By relying on pre-defined index, to access parts of the grouped table (for example the track information table) that correspond to certain rows in the grouping table (for example the collisions table). This is possible for any tables, related by index, if the index column of the table is sorted. The webbased analysis tools automatically adds service tasks at certain stages, that provide certain common information like particle identification (PID) decisions or track selection flags, when deploying a workflow

Bulk operations and declarative analysis features

Future developments and conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ALICE Run 3 Analysis Framework

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences

Lead the way for us

Similar Papers

Hyperloop – The ALICE analysis train system for Run 3
Raquel Quishpe ... Costin Grigoras
-
Raquel Quishpe, et. al.Raquel Quishpe ... Costin Grigoras
20 Oct 2021
20 Oct 2021

Data Analysis using ALICE Run 3 Framework
Giulio Eulisse ... D Kim
EPJ Web of Conferences | VOL. 245
Giulio Eulisse, et. al.Giulio Eulisse ... D Kim
01 Jan 2020
EPJ Web of Conferences | VOL. 245

PID performance of the High Momentum Particle IDentification (HMPID) detector during LHC-Run 2
Giacomo Volpe
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment | VOL. 952
Giacomo VolpeGiacomo Volpe
22 Jan 2019
Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment | VOL. 952

Measurements of (anti)(hyper)nuclei with ALICE
Ivan Vorobyev ... R Rapp
EPJ Web of Conferences | VOL. 296
Ivan Vorobyev, et. al.Ivan Vorobyev ... R Rapp
01 Jan 2024
EPJ Web of Conferences | VOL. 296

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ALICE Run 3 Analysis Framework

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences