Abstract

The ATLAS Distributed Computing (ADC) project delivers production tools and services for ATLAS offline activities such as data placement and data processing on the Grid. The system has been capable of sustaining with high efficiency the needed computing activities during the first run of LHC data taking, and has demonstrated flexibility in reacting promptly to new challenges. Databases are a vital part of the whole ADC system. The Oracle Relational Database Management System (RDBMS) has been addressing a majority of the ADC database requirements for many years. Much expertise was gained through the years and without a doubt will be used as a good foundation for the next generation PanDA (Production ANd Distributed Analysis) and DDM (Distributed Data Management) systems. In this paper we present the current production ADC database solutions and notably the planned changes on the PanDA system, and the next generation ATLAS DDM system called Rucio. Significant work was performed on studying different solutions to arrive at the best relational and physical database model for performance and scalability in order to be ready for deployment and operation in 2014.

Highlights

  • ATLAS databases topology Main ADC (ATLAS Distributed Computing) systems hosted on the ATLAS Distributed Computing Oracle Real Application Cluster (ADCR)

  • In Production and Distributed Analysis (PanDA) and other ATLAS applications interval partitioning is very handy for transient type of data where we impose a policy of agreed data sliding window

  • PanDA monitor can benefit from the Active Data Guard (ADG) resources => An option is to sustain two connection pools: one to the primary database ADCR

Read more

Summary

Next generation database relational solutions for ATLAS Distributed Computing

ATLAS databases topology Main ADC (ATLAS Distributed Computing) systems hosted on the ATLAS Distributed Computing Oracle Real Application Cluster (ADCR). DB volumes, rates and challenges Data segments organization, technical solutions and specific DB performance and tuning techniques Development of new large scale applications Conclusions

Main applications hosted on the ADCR DB
ATLAS databases topology
PanDA system
Advantages of the PanDA data segmentation
Additional DB techniques used in PanDA
Potential use of ADG from PanDA monitor
Rucio system
Rucio development
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call