Abstract

The Cherenkov Telescope Array (CTA) is the next-generation instrument in the field of very high energy gamma-ray astronomy. It will be composed of two arrays of Imaging Atmospheric Cherenkov Telescopes, located at La Palma (Spain) and Paranal (Chile). The construction of CTA has just started with the installation of the first telescope on site at La Palma and the first data expected by the end of 2018. The scientific operations should begin in 2022 for a duration of about 30 years. The overall amount of data produced during these operations is around 27 PB per year. The associated computing power for data processing and Monte Carlo (MC) simulations is of the order of hundreds of millions of CPU HS06 hours per year. In order to cope with these high computing requirements, we have developed a production system prototype based on the DIRAC framework, that we have intensively exploited during the past 6 years to handle massive MC simulations on the grid for the CTA design and prototyping phases. CTA workflows are composed of several inter-dependent steps, which we used to handle separately within our production system. In order to fully automatize the whole workflows execution, we have partially revised the production system by further enhancing the data-driven behavior and by extending the use of meta-data to link together the different steps of a workflow. In this contribution we present the application of the production system to the last years MC campaigns as well as the recent production system evolution, intended to obtain a fully data-driven and automatized workflow execution for efficient processing of real telescope data.

Highlights

  • In order to handle massive Monte Carlo productions on the EGI (European Grid Initiative [1]) grid for the Cherenkov Telescope Array (CTA) Consortium [2], we have developed a production setup based on the DIRAC framework [3][4] (CTA–DIRAC setup)

  • In order to cope with these high computing requirements, we have developed a production system prototype based on the DIRAC framework, that we have intensively exploited during the past 6 years to handle massive Monte Carlo (MC) simulations on the grid for the CTA design and prototyping phases

  • In order to fully automatize the whole workflows execution, we have partially revised the production system by further enhancing the data-driven behavior and by extending the use of meta-data to link together the different steps of a workflow. In this contribution we present the application of the production system to the last years MC campaigns as well as the recent production system evolution, intended to obtain a fully data-driven and automatized workflow execution for efficient processing of real telescope data

Read more

Summary

Introduction

In order to handle massive Monte Carlo productions on the EGI (European Grid Initiative [1]) grid for the CTA Consortium [2], we have developed a production setup based on the DIRAC framework [3][4] (CTA–DIRAC setup). For the software deployment and access by grid jobs we rely on the CVMFS [5] system with a Stratum-0 server hosted at CC–IN2P3 and two Stratum-1 servers at CC–IN2P3 and DESY. From the point of view of the accessed computing resources, we have integrated for the first time some cloud resources in the CTA–DIRAC setup.

Cloud resources integration
Monte Carlo productions
Production management
General purpose production system
Architecture
Evolution
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.