Mitigating Alert Fatigue in Cloud Monitoring Systems: A Machine Learning Perspective

Fotios Voutsas,John Violos,Aris Leivadeas

doi:10.1016/j.comnet.2024.110543

Abstract

Next generation networks will be largely based on monitoring and telemetry tools that are essential for maintaining optimal performance, ensuring security, managing costs, and performing fault detection and resolution. An integral part of the overall monitoring strategy is alerting, which provides administrators with the necessary information to proactively or reactively manage and optimize network services. However, when monitoring systems generate an excessive number of alerts, many of which may not be actionable or may not represent critical issues, the phenomenon of alert fatigue occurs. Alert fatigue refers to a situation where the volume and the speed of the continuous influx of alerts becomes so overwhelming that the network administrators become desensitized and do not respond to them. To this end, and inspired by recent trends in network automation, where human intervention tends to be minimized, we introduce an alert fatigue mitigation mechanism in monitoring focusing on cloud computing infrastructures. In particular, a composite machine learning methodology is proposed in order to select which alerts will be hidden and which ones will be presented to the administrators. Additionally, to personalize the results, the proposed approach considers the level of users’ experience along with the alert features to further optimize the accuracy of the alert filtering mechanism. The research has been conducted in a realistic environment of a leading monitoring enterprise, Netdata, which provided two datasets for testing our approach. Furthermore, the attained results of the filtering mechanism were evaluated by expert engineers of the company that verified the output of the proposed framework. Specifically, the outcomes confirm that our proposed methodology mitigates the alert fatigue problem with an accuracy that surpass 90% in most cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Networks	Publication Date: May 25, 2024
Citations: 1	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Mitigating Alert Fatigue in Cloud Monitoring Systems: A Machine Learning Perspective

Abstract

Talk to us

Similar Papers

More From: Computer Networks

Lead the way for us

Similar Papers

Generating adaptive network data visualization to different levels of users
Doris Hooi-Ten Wong ... Sureswaran Ramadass
-
Doris Hooi-Ten Wong, et. al.Doris Hooi-Ten Wong ... Sureswaran Ramadass
01 Oct 2012
01 Oct 2012

Network Automation
Tayyab Muhammad ... Muhammad Munir
European Journal of Technology | VOL. 7
Tayyab Muhammad, et. al.Tayyab Muhammad ... Muhammad Munir
02 Aug 2023
European Journal of Technology | VOL. 7

Personalized Energy Services: A Data-Driven Methodology towards Sustainable, Smart Energy Systems

-

23 May 2017
23 May 2017

Solving Virus Problems by Anti-Virus Developers - A TRIZ Perspective
Umakant Mishra
SSRN Electronic Journal | VOL. -
Umakant MishraUmakant Mishra
02 Jan 2012
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mitigating Alert Fatigue in Cloud Monitoring Systems: A Machine Learning Perspective

Abstract

Talk to us

Similar Papers

More From: Computer Networks