Abstract

In the latest years, CNAF (the national center of the Italian Institute for Nuclear Physics INFN dedicated to Research and Development on Information and Communication Technologies) has been working on the Long Term Data Preservation (LTDP) project for the CDF experiment, active at Fermilab from 1990 to 2011. The main aims of the project are to protect the most relevant part of the CDF RUN-2 data collected between 2001 and 2011 and already stored on tape at CNAF (4 PB), as well as to ensure the availability and the access to the analysis facility to those data over time. Lately, the CDF database, hosting information about CDF datasets such as their structure, file locations and metadata, has been imported from Fermilab to CNAF. Also, the Sequential Access via Metadata (SAM) station data handling tool for CDF data management, that allows to manage data transfers and to retrieve information from the CDF database, has been properly installed and configured at CNAF. This was a fundamental step in the perspective of a complete decommissioning of CDF services on Fermilab side. An access system has been designed and tested to submit CDF analysis jobs, using CDF software distributed via CERN Virtual Machine File System (CVMFS) and requesting delivery of CDF files stored on CNAF tapes, as well as data present only on Fermilab storage archive. Moreover, the availability and the correctness of all CDF data stored on CNAF tapes has been verified. This paper describes all these recent evolutions in detail, presenting the future plans for the LTDP project at CNAF.

Highlights

  • From the very beginning of its activities as a computing center, CNAF took part in the CDF collaboration, providing computing and storage resources for the CDF experiment [1]

  • Even after the end of CDF data taking activities in 2011, Fermi National Accelerator Laboratory (FNAL) continued to collaborate with CNAF in order to build and maintain a CDF Long Term Data Preservation (LTDP) facility at CNAF, with the aim to preserve the data produced by the detector for future analyses and computing activities

  • The CDF job will setup CDF software from OSG CERN Virtual Machine File System (CVMFS), which is properly mounted on the machine, and it will use ifdh to copy files locally through cp command from CNAF tapes via GPFS. ifdh interacts with SamWEB server, interfacing with the Sequential Access via Metadata (SAM) station, which in turn sends a query to the Oracle database and gets the proper file locations

Read more

Summary

Introduction

From the very beginning of its activities as a computing center, CNAF took part in the CDF collaboration, providing computing and storage resources for the CDF experiment [1]. Even after the end of CDF data taking activities in 2011, Fermi National Accelerator Laboratory (FNAL) continued to collaborate with CNAF in order to build and maintain a CDF Long Term Data Preservation (LTDP) facility at CNAF, with the aim to preserve the data produced by the detector for future analyses and computing activities This kind of activity is defined as “framework preservation”, implying the possibility for previously authorized scientific communities to access and use data through dedicated software services [2]. CNAF hosts all the raw data and part of analysis-level Monte Carlo n-tuples belonging to the so-called CDF RUN-2 data taking phase, that lasted from 2001 to 2011 These data were copied from FNAL storage facilities to CNAF data center before 2017, with the primary purpose of keeping an extensive backup copy of the experiment’s most relevant datasets.

The CDF data handling system at FNAL
SAM station
SamWEB
Moving CDF data handling system to CNAF
Relevant results and future steps
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call