Ensemble of Bayesian Predictors and Decision Trees for Proactive Failure Management in Cloud Computing Systems

Qiang Guan,Song Fu,Ziming Zhang

doi:10.4304/jcm.7.1.52-61

Abstract

In modern cloud computing systems, hundreds and even thousands of cloud servers are interconnected by multi-layer networks. In such large-scale and complex systems, failures are common. Proactive failure management is a crucial technology to characterize system behaviors and forecast failure dynamics in the cloud. To make failure predictions, we need to monitor the system execution and collect health-related runtime performance data. However, in newly deployed or managed cloud systems, these data are usually unlabeled. Supervised learning based approaches are not suitable in this case. In this paper, we present an unsupervised failure detection method using an ensemble of Bayesian models. It characterizes normal execution states of the system and detects anomalous behaviors. After the anomalies are verified by system administrators, labeled data are available. Then, we apply supervised learning based on decision tree classifiers to predict future failure occurrences in the cloud. Experimental results in an institute-wide cloud computing system show that our methods can achieve high true positive rate and low false positive rate for proactive failure management.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ensemble of Bayesian Predictors and Decision Trees for Proactive Failure Management in Cloud Computing Systems

Abstract

Talk to us

Similar Papers

More From: Journal of Communications

Lead the way for us

Journal: Journal of Communications	Publication Date: Jan 1, 2012
Citations: 99

Similar Papers

Ensemble of Bayesian Predictors for Autonomic Failure Management in Cloud Computing
Qiang Guan ... Ziming Zhang
-
Qiang Guan, et. al.Qiang Guan ... Ziming Zhang
01 Jul 2011
01 Jul 2011

Differential Diagnosis of Erythemato-Squamous Diseases Using Ensemble of Decision Trees
Mohamed El Bachir Menai ... Nuha Altayash
-
Mohamed El Bachir Menai, et. al.Mohamed El Bachir Menai ... Nuha Altayash
01 Jan 2014
01 Jan 2014

Proactive Failure Management by Integrated Unsupervised and Semi-Supervised Learning for Dependable Cloud Systems
Qiang Guan ... Song Fu
-
Qiang Guan, et. al.Qiang Guan ... Song Fu
01 Aug 2011
01 Aug 2011

FPGA Implementation of Decision Trees and Tree Ensembles for Character Recognition in Vivado Hls
Rafał Kułaga ... Marek Gorgoń
Image Processing & Communications | VOL. 19
Rafał Kułaga, et. al.Rafał Kułaga ... Marek Gorgoń
01 Sep 2014
Image Processing & Communications | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ensemble of Bayesian Predictors and Decision Trees for Proactive Failure Management in Cloud Computing Systems

Abstract

Talk to us

Similar Papers

More From: Journal of Communications