Abstract

The Cherenkov Telescope Array (CTA) is the next-generation instrument in the very-high energy gamma ray astronomy domain. It will consist of tens of Cherenkov telescopes deployed in 2 arrays at La Palma (Spain) and Paranal (ESO, Chile) respectively. Currently under construction, CTA will start operations around 2023 for a duration of about 30 years. During operations CTA is expected to produce about 2 PB of raw data per year plus 5-20 PB of Monte Carlo data. The global data volume to be managed by the CTA archive, including all versions and copies, is of the order of 100 PB with a smooth growing profile. The associated processing needs are also very high, of the order of hundreds of millions of CPU HS06 hours per year. In order to optimize the instrument design and study its performances, during the preparatory phase (2010-2017) and the current construction phase, the CTA consortium has run massive Monte Carlo productions on the EGI grid infrastructure. In order to handle these productions and the future data processing, we have developed a production system based on the DIRAC framework. The current system is the result of several years of hardware infrastructure upgrades, software development and integration of different services like CVMFS and FTS. In this paper we present the current status of the CTA production system and its exploitation during the latest large-scale Monte Carlo campaigns.

Highlights

  • The Cherenkov Telescope Array (CTA) production system is in charge of handling the future data processing and Monte Carlo (MC) simulations of the CTA observatory [1]

  • The prototype presented in this paper is based on the DIRAC framework [2] [3] and has been used since several years to handle the massive Monte Carlo simulations for the CTA consortium on the EGI grid [4] [5]

  • The first large scale application of the Production System was for the handling of the recent MC production (‘prod5’ ) that we have performed for the CTA consortium in 2020-2021

Read more

Summary

Introduction

The CTA production system is in charge of handling the future data processing and Monte Carlo (MC) simulations of the CTA observatory [1]. The prototype presented in this paper is based on the DIRAC framework [2] [3] and has been used since several years to handle the massive Monte Carlo simulations for the CTA consortium on the EGI grid [4] [5]. We have contributed to DIRAC core software by developing a new component to further automatize the workflow management. This component, called Production System in the DIRAC jargon (not to be confused with the CTA production system mentioned above), is described in detail in [6] and has been integrated in one of the major releases in 2020. We will present our conclusions on the developed prototype and our plans to update the system for the next-coming CTA operations

CTA-DIRAC infrastructure
Computing Model
Workflow Management
Data characterization
Transformation and Production Systems
Application to CTA workflows
Data Management
Conclusions and perspectives
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.