On the implementation of unreliable failure detectors in partially synchronous systems

M Larrea,A Fernandez,S Arevalo

doi:10.1109/tc.2004.33

Abstract

Unreliable failure detectors were proposed by Chandra and Toueg as mechanisms that provide information about process failures. Chandra and Toueg defined eight classes of failure detectors, depending on how accurate this information is, and presented an algorithm implementing a failure detector of one of these classes in a partially synchronous system. This algorithm is based on all-to-all communication and periodically exchanges a number of messages that is quadratic on the number of processes. We study the implementability of different classes of failure detectors in several models of partial synchrony. We first show that no failure detector with perpetual accuracy (namely, P, Q, S, and W) can be implemented in these models in systems with even a single failure. We also show that, in these models of partial synchrony, it is necessary a majority of correct processes to implement a failure detector of the class /spl theta/ proposed by Aguilera et al. Then, we present a family of distributed algorithms that implement the four classes of unreliable failure detectors with eventual accuracy (namely, /spl diams/P, /spl diams/Q, /spl diams/S, and /spl diams/W). Our algorithms are based on a logical ring arrangement of the processes, which defines the monitoring and failure information propagation pattern. The resulting algorithms periodically exchange at most a linear number of messages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the implementation of unreliable failure detectors in partially synchronous systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers

Lead the way for us

Journal: IEEE Transactions on Computers	Publication Date: Jul 1, 2004
Citations: 74

Similar Papers

Efficient Algorithms to Implement Unreliable Failure Detectors in Partially Synchronous Systems
Mikel Larrea ... Sergio Arevalo
-
Mikel Larrea, et. al.Mikel Larrea ... Sergio Arevalo
01 Jan 1998
01 Jan 1998

Eventually consistent failure detectors
M Larrea ... A Fernandez
-
M Larrea, et. al.M Larrea ... A Fernandez
09 Jan 2002
09 Jan 2002

Eventually consistent failure detectors
Mikel Larrea ... Sergio Arévalo
Journal of Parallel and Distributed Computing | VOL. 65
Mikel Larrea, et. al.Mikel Larrea ... Sergio Arévalo
22 Jan 2005
Journal of Parallel and Distributed Computing | VOL. 65

Eventually consistent failure detectors
Mikel Larrea ... Antonio Fernández
-
Mikel Larrea, et. al.Mikel Larrea ... Antonio Fernández
03 Jul 2001
03 Jul 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the implementation of unreliable failure detectors in partially synchronous systems

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computers