Abstract

AsyncStageOut (ASO) is the component of the CMS distributed data analysis system (CRAB) that manages users transfers in a centrally controlled way using the File Transfer System (FTS3) at CERN. It addresses a major weakness of the previous, decentralized model, namely that the transfer of the user’s output data to a single remote site was part of the job execution, resulting in inefficient use of job slots and an unacceptable failure rate.Currently ASO manages up to 600k files of various sizes per day from more than 500 users per month, spread over more than 100 sites. ASO uses a NoSQL database (CouchDB) as internal bookkeeping and as way to communicate with other CRAB components. Since ASO/CRAB were put in production in 2014, the number of transfers constantly increased up to a point where the pressure to the central CouchDB instance became critical, creating new challenges for the system scalability, performance, and monitoring. This forced a re-engineering of the ASO application to increase its scalability and lowering its operational effort.In this contribution we present a comparison of the performance of the current NoSQL implementation and a new SQL implementation, and how their different strengths and features influenced the design choices and operational experience. We also discuss other architectural changes introduced in the system to handle the increasing load and latency in delivering output to the user.

Highlights

  • The Compact Muon Solenoid (CMS) experiment at CERN is one of the four major experiments of the Large Hadron Collider (LHC) [1]

  • This is expected because ASO cannot keep up with processing and CouchDB cannot keep up with indexing the large number of new documents

  • In this paper we have presented the improved performance of the ASO component of CRAB achieved by replacing its CouchDB NoSQL database by an Oracle relational database

Read more

Summary

Related content

- FTS3: Quantitative Monitoring H Riahi, M Salichos, O Keeble et al. - Resilient FTS3 service at GridKa T. - FTS3: Quantitative Monitoring H Riahi, M Salichos, O Keeble et al. - Resilient FTS3 service at GridKa T. - CMS users data management service integration and first experiences with its NoSQL data storage H Riahi, D Spiga, T Boccali et al. View the article online for updates and enhancements. D Ciangottini, J Balcas, M Mascheroni, E A Rupeika, E Vaandering, H Riahi, J M D Silva, J M Hernandez, S Belforte, T T Ivanov

Introduction
Published under licence by IOP Publishing Ltd
Findings
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.