Abstract

In the field of high-energy physics, Event is the basic data unit, referring to a particle collision or interaction among particles. At present, advanced physical experimental devices can produce a large amount of Event data up to PB level. While Compared to these massive data generation, data storage system based on files at the moment is out of date. Event data are mostly random accessed, but searching a few specific Event in large files is an inefficient job. Therefore this paper proposes an event-oriented data storage technology, caching frequently accessed data in HBase. This paper serializes the Event data in the ROOT files and dumps them to intermediate files, and then a large number of intermediate files are transferred into the HBase by the method of bulkload and cluster resources. Eventually, using the data of Beijing Spectrometer (BESIII) Experiment, we conduct a data access experiment. The result shows that the new storage schema can improve the performance of data random access by more than 4 times.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.