Abstract

The Belle II experiment, a major upgrade of the previouse+e−asymmetric collider experiment Belle, is expected to produce tens of petabytes of data per year due to the luminosity increase from the upgraded SuperKEKB accelerator. The distributed computing system of the Belle II experiment plays a key role, storing and distributing data in a reliable way to be easily accessed and analyzed by more than 1000 collaborators. In particular, the Belle II Raw Data Management system has been developed with an aim to upload output files onto grid storage, register them into the file and metadata catalogs, and make two replicas of the full raw data set using the Belle II Distributed Data Management system. It has been implemented as an extension of DIRAC (Distributed Infrastructure with Remote Agent Control) and consists of a database, services, client and monitoring tools, and several agents that treat the data automatically. The first year of data taken with the Belle II full detector has been managed by the Belle II Raw Data Management system successfully. The design, current status, and performance are presented. Prospects for improvements towards the full luminosity data taking are also reviewed.

Highlights

  • The SuperKEKB accelerator and the Belle II detector are major upgrades of the KEKB accelerator and Belle detector, located at the KEK laboratory in Tsukuba, Japan [1]

  • The grid system of the Belle II experiment consists of 58 computing sites and 21 storage elements around the world, centrally managed with DIRAC

  • The Belle II Raw Data Management system is the main component of BelleRawDIRAC, another extension of DIRAC on top of BelleDIRAC, dedicated to registration and replication of raw data files

Read more

Summary

Introduction

The SuperKEKB accelerator and the Belle II detector are major upgrades of the KEKB accelerator and Belle detector, located at the KEK laboratory in Tsukuba, Japan [1]. Electrons and positrons collide with a high rate, with an expectation to accumulate a data sample of 50 ab−1, approximately 50 times more than its predecessor With this projected luminosity, Belle II is expected to produce tens of petabytes of real and simulated data per year. For this purpose, the distributed computing group of Belle II has designed a customized system to perform the registration and replication of raw data files efficiently

Raw Data Registration and Replication
Monitoring
Raw Data Processing during the Early Operation of Belle II
Monitoring System
Findings
Conclusions and Perspectives
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call