Erratic server behavior detection using machine learning on streams of monitoring data

Martin Adam,L Magnoni ,Martin Pilát ,D Adamová

doi:10.1051/epjconf/202024507002

Abstract

With the explosion of the number of distributed applications, a new dynamic server environment emerged grouping servers into clusters, utilization of which depends on the current demand for the application. To provide reliable and smooth services it is crucial to detect and fix possible erratic behavior of individual servers in these clusters. Use of standard techniques for this purpose requires manual work and delivers sub-optimal results. Using only application agnostic monitoring metrics our machine learning based method analyzes the recent performance of the inspected server as well as the state of the rest of the cluster, thus checking not only the behavior of the single server, but the load on the whole distributed application as well. We have implemented our method in a Spark job running in the CERN MONIT infrastructure. In this contribution we present results of testing multiple machine learning algorithms and pre-processing techniques to identify the servers erratic behavior. We also discuss the challenges of deploying our new method into production.

Highlights

In the last few decades the amount of digitally saved data has been growing exponentially [? ]
In 2014 the world’s technological capacity to store information has reached almost 5 zettabytes [1]. Handling this incredible amount of incoming data requires innovative techniques increasingly leveraging horizontal scaling; an approach utilizing many computers instead of one more powerful. Such novel approaches create additional concerns for the system administrators, when it comes to noticing errors that pose a threat to the efficiency and availability of the application
We present a process of acquiring and processing a stream of raw monitoring data in the MONIT [2] infrastructure

Summary

Introduction

In the last few decades the amount of digitally saved data has been growing exponentially [? ]. In 2014 the world’s technological capacity to store information has reached almost 5 zettabytes [1] Handling this incredible amount of incoming data requires innovative techniques increasingly leveraging horizontal scaling; an approach utilizing many computers instead of one more powerful. Traditional monitoring methods require lots of manual labor when applied to this problem, developers of open source monitoring system have been so far reluctant to include any advanced tools In this project we set to explore the possibility of using machine learning to spot erratic servers within a cluster running a distributed application. In an attempt to simplify administrators work, many applications offer a set of internal metrics describing their performance Incorporating these metrics in the existing monitoring systems might be too time-consuming, considering that the lack of skilled administrators often leads to understaffed teams. We discuss the efficiency of such an approach, benchmark the core model and present plans for future development

Monitoring Systems Overview

Data Gathering

Creating Anomalies

Analysing the Data

Unsupervised learning

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Erratic server behavior detection using machine learning on streams of monitoring data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences

Lead the way for us

Journal: EPJ Web of Conferences	Publication Date: Jan 1, 2020
License type: CC BY 4.0

Similar Papers

Detection of Erratic Behavior in Load Balanced Clusters of Servers Using a Machine Learning Based Method
Martin Adam ... P Hristov
EPJ Web of Conferences | VOL. 214
Martin Adam, et. al.Martin Adam ... P Hristov
01 Jan 2019
EPJ Web of Conferences | VOL. 214

Transient protein-protein interface prediction: datasets, features, algorithms, and the RAD-T predictor
Calem J Bendell ... Paul T Cernek
BMC Bioinformatics | VOL. 15
Calem J Bendell, et. al.Calem J Bendell ... Paul T Cernek
24 Mar 2014
BMC Bioinformatics | VOL. 15

Erratic server behavior detection using machine learning on basic monitoring metrics
Martin Adam ... Luca Magnoni
-
Martin Adam, et. al.Martin Adam ... Luca Magnoni
09 Mar 2021
09 Mar 2021

Stream-based Machine Learning for Network Security and Anomaly Detection
Pavol Mulinka ... Pedro Casas
-
Pavol Mulinka, et. al.Pavol Mulinka ... Pedro Casas
07 Aug 2018
07 Aug 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Erratic server behavior detection using machine learning on streams of monitoring data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EPJ Web of Conferences