Abstract

CERN's accelerator complex generates a very large amount of data. A large volumen of heterogeneous data is constantly generated from control equipment and monitoring agents. These data must be stored and analysed. Over the decades, CERN's researching and engineering teams have applied different approaches, techniques and technologies for this purpose. This situation has minimised the necessary collaboration and, more relevantly, the cross data analytics over different domains. These two factors are essential to unlock hidden insights and correlations between the underlying processes, which enable better and more efficient daily-based accelerator operations and more informed decisions. The proposed Big Data Analytics as a Service Infrastructure aims to: (1) integrate the existing developments; (2) centralise and standardise the complex data analytics needs for CERN's research and engineering community; (3) deliver real-time, batch data analytics and information discovery capabilities; and (4) provide transparent access and Extract, Transform and Load (ETL), mechanisms to the various and mission-critical existing data repositories. This paper presents the desired objectives and properties resulting from the analysis of CERN's data analytics requirements; the main challenges: technological, collaborative and educational and; potential solutions.

Highlights

  • CERN’s particle accelerator infrastructure is complex and heterogeneous

  • The second involves an implementation of a Big Data Analytics infrastructure as a Service (DAaaS), which aims to: (1) integrate the existing developments; (2) centralise and standardise the complex data analytics needs for the CERN’s research and engineering community; (3) deliver real time, batch data analytics and information discovery capabilities; and (4) Provide transparent access and ETL, mechanisms to the different and mission-critical existing data repositories

  • CERN has dedicated significant efforts to developing those systems, which has led to important data investments

Read more

Summary

Introduction

CERN’s particle accelerator infrastructure is complex and heterogeneous. Several missioncritical subsystems, which represent cutting-edge technology in several engineering fields, are involved. The second involves an implementation of a Big Data Analytics infrastructure as a Service (DAaaS), which aims to: (1) integrate the existing developments; (2) centralise and standardise the complex data analytics needs for the CERN’s research and engineering community; (3) deliver real time, batch data analytics and information discovery capabilities; and (4) Provide transparent access and ETL, mechanisms to the different and mission-critical existing data repositories.

Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.