Abstract

The ALICE experiment will undergo extensive hardware and software upgrades for the LHC Run3. This translates in significant increase of the CPU and storage resources required for data processing, and at the same time the data access rates will grow linearly with the amount of resources. JAliEn (Java ALICE Environment) is the new Grid middleware designed to scale-out horizontally to fulfil the computing needs of the upgrade, and at the same time to modernize all parts of the distributed system software. This paper will present the architecture of the JAliEn framework, the technologies used and performance measurements. This work will also describe the next generation solution that will replace our main database backend, the AliEn File Catalogue. The catalogue is an integral part of the system, containing the metadata of all files written to the distributed Grid storage and also provides powerful search and data manipulation tools. As for JAliEn, the focus has been put onto horizontal scalability, with the aim to handle near exascale data volumes and order of magnitude more workload than the currently used Grid middleware. Lastly, this contribution will present how JAliEn manages the increased complexity of the tasks associated with the new ALICE data processing and analysis framework (ALFA) and multi-core environments.

Highlights

  • The ALICE [1] experiment at CERN is one of the four big particle detectors at the Large Hadron Collider (LHC)

  • Java ALICE Environment (JAliEn) [3] is the new middleware framework used on top of Worldwide LHC Computing Grid (WLCG) within ALICE

  • Different architectural and technological changes have been introduced to JAliEn to make it capable of fulfilling the ALICE requirements in Run3 and beyond, and some will be described by this work

Read more

Summary

Introduction

The ALICE [1] experiment at CERN is one of the four big particle detectors at the Large Hadron Collider (LHC). From the initial Grid capacity to today, the deployed computing resources have grown continuously and in the LHC data taking years (2010 to 2018) the CPU and storage capacity have increased by a factor 10. The ALICE Run 3 upgrade will translate yet into more resource increases associated to the O2 facility, which is a purpose-built large computational cluster to be used for synchronous and asynchronous data processing. It will comprise of 60 petabytes of disk and around 100000 CPU cores. For improved scalability and maintainability, the ALICE high-level Grid services and clients are being reimplemented as the new JAliEn framework

Java ALICE Environment framework
Authentication and authorization model
Client-server
Data access
Job pilot
Isolation of the payload using containers
JAliEn File Catalogue
Performance and size goals
Apache Cassandra and ScyllaDB internals and tuning
Benchmark results
Lessons learnt
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.