Abstract

To optimise the performance of distributed compute, smaller lightweight storage caches are needed which integrate with existing grid computing workflows. A good solution to provide lightweight storage caches is to use an XRootD-proxy cache. To support distributed lightweight XRootD proxy services across GridPP we have developed a centralised monitoring framework. With the v5 release of XRootD it is possible to build a monitoring framework which collects distributed caching metadata broadcast from multiple sites. To provide the best support for these distributed caches we have built a centralised monitoring service for XRootD storage instances within GridPP. This monitoring solution is built upon experiences presented by CMS in setting up a similar service as part of their AAA system. This new framework is designed to provide remote monitoring of the behaviour, performance, and reliability of distributed XRootD services across the UK. Effort has been made to simplify ease of deployment by remote site administrators. The result of this work is an interactive dashboard system which enables administrators to access real-time metrics on the performance of their lightweight storage systems. This monitoring framework is intended to supplement existing functionality and availability testing metrics by providing detailed information and logging from a site perspective.

Highlights

  • To optimise the amount of compute available to GridPP [1], [2] smaller sites are focussing on maximising their processing power contributions

  • Previous work at Edinburgh [4] has focussed on developing a monitoring system targeted towards XRootD-proxy caches used within the UK at Edinburgh and Birmingham WLCG [5] Tier2 facilities

  • Making use of the monitoring framework which has been deployed for GridPP, it’s possible to view live statistics of various sites throughout the UK in much the same way as AAA is able to view live XRootD activities for CMS

Read more

Summary

Introduction

To optimise the amount of compute available to GridPP [1], [2] smaller sites are focussing on maximising their processing power contributions Providing access to these distributed computing facilities requires these sites to run a lightweight storage cache to provide high overall job efficiency. Previous work at Edinburgh [4] has focussed on developing a monitoring system targeted towards XRootD-proxy caches used within the UK at Edinburgh and Birmingham WLCG [5] Tier facilities. This system was built by adopting the monitoring approaches used within XCache [6] as part of the SLATE project [7]. With the release of v5 of the XRootD project it is possible to integrate our monitoring more closely with XRootD improving the experience of sites looking to deploy such a service

Requirements for the GridPP XRootD monitoring framework
Integration with v5 XRootD
Ingesting monitoring data
Results
Summary
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call