Computing Fault-Containment Times of Self-Stabilizing Algorithms Using Lumped Markov Chains

Volker Turau

doi:10.3390/a11050058

Abstract

The analysis of self-stabilizing algorithms is often limited to the worst case stabilization time starting from an arbitrary state, i.e., a state resulting from a sequence of faults. Considering the fact that these algorithms are intended to provide fault tolerance in the long run, this is not the most relevant metric. A common situation is that a running system is an a legitimate state when hit by a single fault. This event has a much higher probability than multiple concurrent faults. Therefore, the worst case time to recover from a single fault is more relevant than the recovery time from a large number of faults. This paper presents techniques to derive upper bounds for the mean time to recover from a single fault for self-stabilizing algorithms based on Markov chains in combination with lumping. To illustrate the applicability of the techniques they are applied to a new self-stabilizing coloring algorithm.

Highlights

Fault tolerance aims at making distributed systems more reliable by enabling them to continue the provision of services in the presence of faults
In particular we demonstrate how lumping can be applied to reduce the complexity of the Markov chains
The analysis of self-stabilizing algorithms is often confined to the stabilization time starting from an arbitrary configuration

Summary

Introduction

Fault tolerance aims at making distributed systems more reliable by enabling them to continue the provision of services in the presence of faults. Self-stabilizing algorithms belong to the category of distributed algorithms that provide non-masking fault tolerance They guarantee that systems eventually recover from transient faults of any scale such as perturbations of the state in memory or communication message corruption [2]. The containment time of A denotes the worst-case number of rounds any execution of A starting at a 1-faulty configuration needs to reach a legitimate configuration. The reason is that a distributed system consists of independently operating computers where transient faults such as memory faults in different computers are independent events Considering this fact it comes as a surprise that most papers consider only arbitrary initial states (i.e., k-faulty configuration for any k) instead of focusing on 1-faulty configuration. We believe that the techniques can be applied to other algorithms

Related Work

System Model

Contamination Radius

Containment Time

Self-Stabilizing Algorithms and Markov Chains

Algorithm Acol

Fault Containment Time of Algorithm Acol

Message Corruption

Memory Corruption

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms	Publication Date: May 3, 2018
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Computing Fault-Containment Times of Self-Stabilizing Algorithms Using Lumped Markov Chains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Similar Papers

A row-based FPGA for single and multiple stuck-at fault detection
X.T Chen ... W.K Huang
-
X.T Chen, et. al.X.T Chen ... W.K Huang
13 Nov 1995
13 Nov 1995

Efficient SAT-based ATPG techniques for all multiple stuck-at faults
Masahiro Fujita ... Alan Mishchenko
-
Masahiro Fujita, et. al.Masahiro Fujita ... Alan Mishchenko
01 Oct 2014
01 Oct 2014

Computing the Fault-Containment Time of Self-Stabilizing Algorithms Using Markov Chains and Lumping
Volker Turau
-
Volker TurauVolker Turau
01 Jan 2017
01 Jan 2017

Test pattern generation for multiple stuck-at faults not covered by test patterns for single faults
Conrad J Moore ... Peikun Wang
-
Conrad J Moore, et. al.Conrad J Moore ... Peikun Wang
01 May 2017
01 May 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Computing Fault-Containment Times of Self-Stabilizing Algorithms Using Lumped Markov Chains

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms