Smart Caching at CMS: applying AI to XCache edge services

Daniele Spiga,Tommaso Tedeschi,Diego Ciangottini,Daniele Cesini,Tommaso Boccali,Valentin Y Kuznetsov,Marco Baioletti,Valentina Poggioni,Mirco Tracolli,C Doglioni,D Kim,G.A Stewart,L Silvestris,P Jackson,W Kamleh

doi:10.1051/epjconf/202024504024

Daniele Spiga, Tommaso Tedeschi + Show 13 more

Open Access

https://doi.org/10.1051/epjconf/202024504024

Copy DOI

Abstract

The projected Storage and Compute needs for the HL-LHC will be a factor up to 10 above what can be achieved by the evolution of current technology within a flat budget. The WLCG community is studying possible technical solutions to evolve the current computing in order to cope with the requirements; one of the main focus is resource optimization, with the ultimate aim of improving performance and efficiency, as well as simplifying and reducing operation costs. As of today the storage consolidation based on a Data Lake model is considered a good candidate for addressing HL-LHC data access challenges. The Data Lake model under evaluation can be seen as a logical system that hosts a distributed working set of analysis data. Compute power can be “close” to the lake, but also remote and thus completely external. In this context we expect data caching to play a central role as a technical solution to reduce the impact of latency and reduce network load. A geographically distributed caching layer will be functional to many satellite computing centers that might appear and disappear dynamically. In this talk we propose a system of caches, distributed at national level, describing both deployment and results of the studies made to measure the impact on the CPU efficiency. In this contribution, we also present the early results on novel caching strategy beyond the standard XRootD approach whose results will be a baseline for an AI-based smart caching system.

Highlights

With the upcoming High Luminosity LHC (HL-LHC) [1] program at CERN all HEP experiments will face a new challenge, the exabyte era of computing [2]
The projected Storage and Compute needs for the HL-LHC will be a factor up to 10 above what can be achieved by the evolution of current technology within a flat budget
The WLCG community is studying possible technical solutions to evolve the current computing in order to cope with the requirements; one of the main focus is resource optimization, with the ultimate aim of improving performance and efficiency, as well as simplifying and reducing operation costs

Summary

Introduction

With the upcoming High Luminosity LHC (HL-LHC) [1] program at CERN all HEP experiments will face a new challenge, the exabyte era of computing [2]. In order to cope with this, a series of R&D programs have been established with the purpose of finding viable solutions for the optimization of the computing models In this context the activity presented in this work focuses on the storage, looking for solutions in order to minimize the hardware usage and to increase performances, e.g. to improve CPU efficiency by reducing I/O latencies and, to introduce handles to simplify operations, which represent an important cost to the collaboration. In Sec. we report on the deployment done to integrate an INFN federation of distributed caches within the “Anydata, Anytime, Anywhere (AAA)” federation [7] of CMS This includes a summary of the studies made to measure the effect of data caches on CPU job efficiency. We conclude with the plan for the extension toward an ML-based mechanism for caching management

The INFN distributed cache system

Studies on cache effect on CMS analysis jobs

Disk cache optimization: the idea

Strategy

The weight function

Results

Summary and future directions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Smart Caching at CMS: applying AI to XCache edge services

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences

Lead the way for us

Similar Papers

A novel caching scheme for the backbone of Named data networking
Hao Wu ... Bin Liu
-
Hao Wu, et. al.Hao Wu ... Bin Liu
01 Jun 2013
01 Jun 2013

Smart caching for QoS-guaranteed device-to-device content delivery
Yanli Xu
China Communications | VOL. 15
Yanli XuYanli Xu
01 Jan 2018
China Communications | VOL. 15

Proposed Caching Scheme for Optimizing Trade-off between Freshness and Energy Consumption in Name Data Networking Based IoT
Rahul Shrimali ... Hemal Shah
Advances in Internet of Things | VOL. 07
Rahul Shrimali, et. al.Rahul Shrimali ... Hemal Shah
01 Jan 2017
Advances in Internet of Things | VOL. 07

PSAC: Proactive Sequence-Aware Content Caching via Deep Learning at the Network Edge
Yin Zhang ... Yujie Li
IEEE Transactions on Network Science and Engineering | VOL. 7
Yin Zhang, et. al.Yin Zhang ... Yujie Li
27 Apr 2020
IEEE Transactions on Network Science and Engineering | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Smart Caching at CMS: applying AI to XCache edge services

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences