Self-healing in autonomic distributed systems based on delayed communication-induced checkpointing

Alberto Calixto Simón,Saul E Pomares Hernandez,Hatem Hadj Kacem,Jose Roberto Perez Cruz,Riadh Ben Halima

doi:10.1504/ijaacs.2016.079621

Abstract

An autonomic distributed system is composed of geographically distributed autonomic components. One open challenge in autonomic computing is the efficient monitoring at runtime oriented towards the collection of information, from which the system itself will detect, diagnose, and repair problems that result from failures in software and/or hardware components. For this purpose, communication-induced checkpointing CIC can be a useful tool. CIC aims to form global consistent snapshots from which the system can recover. To achieve this, CIC solutions monitor exchanged information among the processes to identify dangerous checkpointing patterns. When a dangerous pattern is identified, it is broken by locally triggering a forced checkpoint. Nevertheless, not all triggered forced checkpoints are necessary. In this paper, we present a delayed CIC approach that reduces forced checkpoints by using triggering rules called safe checkpoint conditions. Finally, we present simulation results that show that our proposal is more efficient than other current solutions.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Self-healing in autonomic distributed systems based on delayed communication-induced checkpointing

Abstract

Talk to us

Similar Papers

More From: International Journal of Autonomous and Adaptive Communications Systems

Lead the way for us

Journal: International Journal of Autonomous and Adaptive Communications Systems	Publication Date: Jan 1, 2016
Citations: 4

Similar Papers

A Delayed Checkpoint Approach for Communication-Induced Checkpointing in Autonomic Computing
Saul E Pomares Hernandez ... Jose Roberto Perez Cruz
-
Saul E Pomares Hernandez, et. al.Saul E Pomares Hernandez ... Jose Roberto Perez Cruz
01 Jun 2013
01 Jun 2013

Autonomic Web Services Based on Different Adaptive Quasi-Asynchronous Checkpointing Techniques
Saul Pomares-Hernandez ... Mariano Vargas-Santiago
Applied Sciences | VOL. 10
Saul Pomares-Hernandez, et. al.Saul Pomares-Hernandez ... Mariano Vargas-Santiago
05 Apr 2020
Applied Sciences | VOL. 10

Autonomic Web Services Enhanced by Asynchronous Checkpointing
Khalil Drira ... Mariano Vargas-Santiago
IEEE Access | VOL. 6
Khalil Drira, et. al.Khalil Drira ... Mariano Vargas-Santiago
01 Jan 2018
IEEE Access | VOL. 6

ECONOMIC AND PERFORMANCE EVALUATION OF STOCHASTIC MODEL ON A BASE TRANSCEIVER SYSTEM CONSIDERING VARIOUS OPERATIONAL MODES AND CATASTROPHIC FAILURES
Kumar
Journal of Mathematics and Statistics | VOL. 9
Kumar Kumar
01 Mar 2013
Journal of Mathematics and Statistics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-healing in autonomic distributed systems based on delayed communication-induced checkpointing

Abstract

Talk to us

Similar Papers

More From: International Journal of Autonomous and Adaptive Communications Systems