Abstract

Abstract High Energy Physics (HEP) Experiments produce large amounts of data. The data produced in these experiments are in the range of terabytes and petabytes. The explosion of data has posed a challenge in data capture, storage, data integrity, searching, querying, visualization and analysis. This has led to the development of domain-specific file formats like FITS, HDF5, analysis frameworks like ROOT, storage architectures like relational and NoSQL databases, and parallel and distributed data handling methodologies. In this paper, we investigate the read–write performance by comparing the HEP domain-specific framework ROOT and a NoSQL database Berkeley DB in the context of a gamma-ray Cerenkov experiment to meet the requirement of real-time data analysis. Major Atmospheric Cerenkov Experiment (MACE) is a 21 m gamma-ray telescope set up by BARC at HANLE, India. It will generate a few hundred gigabytes of data per observational night. Aiming at the real-time analysis of the data we have developed a dynamic reading mechanism by implementing a binary type provider for data retrieval from the Berkeley DB database. Data analysis queries were performed and compared both in ROOT files using ROOT query methods and in Berkeley DB using Language Integrated Queries (LINQ). Finally, a generic framework facilitating the online analysis of the data is proposed in this paper.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call