Abstract

Failure detection is a key technology in tolerant system. Failure detectors without adaptive mechanism cannot meet the requirements of QOS (quality of service) of applications because of the variations of the network in actual distributed system. Adaptive failure detectors should dynamically adjust the detecting quality according to the variations of the real-time state of the network. Assuming that the delay and loss of the messages is a random probability, a failure detection model based on the predicted message delay is proposed in this paper. A PAC-AFD adaptive failure detection algorithm is realized based on the above model which is on the basis of the prediction from historical message delay and contains checking idea. Experimental results show that the algorithm can relieve the effect of the delay and loss of the message on the failure detection while ensuring the accuracy and completeness of detection.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call