Abstract

Fault detection is a fundamental element for management, replication, load balancing, and other services of large-scale distributed systems. But the fault detection services in such systems scale badly in the number of members that are being monitored. This paper describes a new protocol named Adaptive Randomized Gossip-based Fault Detection service (ARGoFD) which based on gossiping that does scale well and provides timely detection. We gave out a detailed description of the protocol and analyzed its main algorithms. At last we did the comparison experiments. The experiment result shows that ARGoFD could effectively reduce the redundant messages amount and increase the detection convergence rate.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.