AutoDiagn: An Automated Real-Time Diagnosis Framework for Big Data Systems

Umit Demirbaga,Khaled Alwasel,Rajiv Ranjan,Ayman Noor,Zhenyu Wen,Karan Mitra,Albert Y Zomaya,Saurabh Garg

doi:10.1109/tc.2021.3070639

Abstract

Big data processing systems, such as Hadoop and Spark, usually work in large-scale, highly-concurrent, and multi-tenant environments that can easily cause hardware and software malfunctions or failures, thereby leading to performance degradation. Several systems and methods exist to detect big data processing systems’ performance degradation, perform root-cause analysis, and even overcome the issues causing such degradation. However, these solutions focus on specific problems such as stragglers and inefficient resource utilization. There is a lack of a generic and extensible framework to support the real-time diagnosis of big data systems. In this article, we propose, develop and validate AutoDiagn. This generic and flexible framework provides holistic monitoring of a big data system while detecting performance degradation and enabling root-cause analysis. We present an implementation and evaluation of AutoDiagn that interacts with a Hadoop cluster deployed on a public cloud and tested with real-world benchmark applications. Experimental results show that AutoDiagn can offer a high accuracy root-cause analysis framework, at the same time as offering a small resource footprint, high throughput, and low latency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AutoDiagn: An Automated Real-Time Diagnosis Framework for Big Data Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Apr 6, 2021
Citations: 11

Similar Papers

A Complex Task Scheduling Scheme for Big Data Platforms Based on Boolean Satisfiability Problem
Huang Hong ... Ayoade Gbadebo
-
Huang Hong, et. al.Huang Hong ... Ayoade Gbadebo
01 Jul 2018
01 Jul 2018

Editorial for Special issue of FGCS special issue on “Benchmarking big data systems”
Sherif Sakr ... Athanasios V Vasilakos
Future Generation Computer Systems | VOL. 96
Sherif Sakr, et. al.Sherif Sakr ... Athanasios V Vasilakos
04 Feb 2019
Future Generation Computer Systems | VOL. 96

A Speculative Execution Framework for Big Data Processing Systems
Samar A Said ... Sameh A Salem
-
Samar A Said, et. al.Samar A Said ... Sameh A Salem
14 Jul 2021
14 Jul 2021

IOTSim: A simulator for analysing IoT applications
Xuezhi Zeng ... Saurabh Kumar Garg
Journal of Systems Architecture | VOL. 72
Xuezhi Zeng, et. al.Xuezhi Zeng ... Saurabh Kumar Garg
05 Jul 2016
Journal of Systems Architecture | VOL. 72

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AutoDiagn: An Automated Real-Time Diagnosis Framework for Big Data Systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers