Abstract

Failure detectors are one of the fundamental components for building a distributed system with high availability. In order to maintain the efficiency and scalability of failure detection in a complicated large-scale distributed system, accrual failure detectors that can adapt to multiple applications have been studied extensively. In this paper, an new accrual failure detector—LA-FD with low system overhead has been proposed specifically for current mobile network equipment on the Internet whose processing power, memory space and power supply are all constrained. It does not rely on the probability distribution of message transmission time, or on the maintenance of a history message window. By simple calculation, LA-FD provides adaptive failure detection service with high accuracy to multiple upper applications. The related experiments and results have also been presented.

Highlights

  • Failure detector is one of the fundamental components for building a distributed system with high availability [1]

  • Sensors 2012, 12 more and more challenges to the efficiency and scalability [6] of failure detectors have been posed by the expanding system scale and increasingly complex distributed applications

  • Taking into account the impact of load on scalability, we can’t supply separate failure detectors for each application. Here comes another requirement for adaptive failure detectors, that is, that they can adapt to different QoS requirements demanded by multiple applications

Read more

Summary

Introduction

Failure detector is one of the fundamental components for building a distributed system with high availability [1]. With the development of various network applications, multiple applications are often running simultaneously in large-scale systems such as grid, P2P and cloud computing They have different failure detection QoS requirements. Here comes another requirement for adaptive failure detectors, that is, that they can adapt to different QoS requirements demanded by multiple applications This has become an important issue in the research of failure detection in large-scale distributed systems [6]. Mobile terminals like cell phones and tablet PCs are being used more widely The majority of such equipment are embedded systems whose processing power, memory space and power supply are all constrained, but the previously proposed accrual detectors require the probability distribution model for message transmission delay. It is able to provide an adaptive failure detection service with high accuracy to multiple upper applications

System Model
Basic Failure Detection Strategy
Basic Idea of the Algorithm
For process q
Experimeental Resullts and Anaalysis
Analysiss of Detectiion Accuraccy
Comparrison of System Overheead
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call