The Belle II Raw Data Management System

Michel Hernández Villanueva,Ikuo Ueda,C Doglioni,P Jackson,L Silvestris,W Kamleh,D Kim,G.A Stewart

doi:10.1051/epjconf/202024504005

Michel Hernández Villanueva, Ikuo Ueda + Show 6 more

Open Access

https://doi.org/10.1051/epjconf/202024504005

Copy DOI

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2020
Citations: 2	License type: CC BY 4.0

Abstract

The Belle II experiment, a major upgrade of the previouse+e−asymmetric collider experiment Belle, is expected to produce tens of petabytes of data per year due to the luminosity increase from the upgraded SuperKEKB accelerator. The distributed computing system of the Belle II experiment plays a key role, storing and distributing data in a reliable way to be easily accessed and analyzed by more than 1000 collaborators. In particular, the Belle II Raw Data Management system has been developed with an aim to upload output files onto grid storage, register them into the file and metadata catalogs, and make two replicas of the full raw data set using the Belle II Distributed Data Management system. It has been implemented as an extension of DIRAC (Distributed Infrastructure with Remote Agent Control) and consists of a database, services, client and monitoring tools, and several agents that treat the data automatically. The first year of data taken with the Belle II full detector has been managed by the Belle II Raw Data Management system successfully. The design, current status, and performance are presented. Prospects for improvements towards the full luminosity data taking are also reviewed.

Highlights

The SuperKEKB accelerator and the Belle II detector are major upgrades of the KEKB accelerator and Belle detector, located at the KEK laboratory in Tsukuba, Japan [1]
The grid system of the Belle II experiment consists of 58 computing sites and 21 storage elements around the world, centrally managed with DIRAC
The Belle II Raw Data Management system is the main component of BelleRawDIRAC, another extension of DIRAC on top of BelleDIRAC, dedicated to registration and replication of raw data files

Summary

Introduction

The SuperKEKB accelerator and the Belle II detector are major upgrades of the KEKB accelerator and Belle detector, located at the KEK laboratory in Tsukuba, Japan [1]. Electrons and positrons collide with a high rate, with an expectation to accumulate a data sample of 50 ab−1, approximately 50 times more than its predecessor With this projected luminosity, Belle II is expected to produce tens of petabytes of real and simulated data per year. For this purpose, the distributed computing group of Belle II has designed a customized system to perform the registration and replication of raw data files efficiently

Raw Data Registration and Replication

Monitoring

Raw Data Processing during the Early Operation of Belle II

Monitoring System

Findings

Conclusions and Perspectives