Abstract

ATLAS is one of the general purpose experiments observing hadron collisions at the LHC at CERN. Its trigger and data acquisition system (TDAQ) is responsible for selecting and transporting interesting physics events from the detector to permanent storage where the data are used for further processing. The transient storage of ATLAS TDAQ is the last component of the online system in the data flow. It records selected events at several GB/s to non-volatile storage before transfer to offline permanent storage. The transient storage is a distributed system consisting of high-performance direct-attached storage servers accounting for 480 hard drives. A distributed multi-threaded C++ application operates the hardware. The transient storage is also responsible for computing a checksum for the data, which is used to ensure data integrity of the transferred data. Reliability and efficiency of this system are critical for the operations of TDAQ as well. This paper presents the existing multi-threading strategy of the software and how the available hardware resources are used. We then introduce how multi-threaded checksum computation was introduced to increase significantly the maximum throughput of the system. We discuss the key concepts of the implementation with a focus on the importance of overhead minimization. Finally the paper reports on the tests done on the production system to demonstrate the validity of the implementation and measurements of the performance improvement in the view of future LHC and ATLAS upgrades.

Highlights

  • ATLAS [1] is one of the general purpose experiments observing hadron collisions at the LHC [2] at CERN

  • Its trigger and data acquisition system (TDAQ) [3] is responsible for selecting and transporting interesting physics events from the detector to permanent storage where the data are used for further processing

  • TDAQ is a complex system consisting of thousands of computers interconnected by several networks

Read more

Summary

Introduction

ATLAS [1] is one of the general purpose experiments observing hadron collisions at the LHC [2] at CERN. As it is part of the online TDAQ system, the performance of the transient storage system has a direct impact on the data-taking capabilities of ATLAS.

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call