Estimating Reliability of Workers for Cooperative Distributed Computing

Seda Davtyan,Alexander A Shvartsman,Kishori M Konwar

doi:10.1109/ispdc.2013.22

Abstract

Internet supercomputing is an approach to solving partitionable, computation-intensive problems by harnessing the power of a vast number of interconnected computers. For the problem of using network supercomputing to perform a large collection of independent tasks, prior work introduced a decentralized approach and provided randomized synchronous algorithms that perform all tasks correctly with high probability, while dealing with misbehaving or crash-prone processors. The main weaknesses of existing algorithms is that they assume either that the average probability of a non-crashed processor returning incorrect results is inferior to 12, or that the probability of returning incorrect results is known to each processor. Here we present a randomized synchronous distributed algorithm that tightly estimates the probability of each processor returning correct results. Starting with the set P of n processors, let F be the set of processors that crash. Our algorithm estimates the probability pi of returning a correct result for each processor i ∈ P - F, making the estimates available to all these processors. The estimation is based on the (ε, δ)-approximation, where each estimated probability p̃ i of p i obeys the bound Pr[p i (1 - ε) ≤ p̃ i ≤ p i (1 + ε)] > 1 - δ, for any constants δ > 0 and ε > 0 chosen by the user. An important aspect of this algorithm is that each processor terminates without global coordination. We assess the efficiency of the algorithm in three adversarial models as follows. For the model where the number of non-crashed processors P - F is linearly bounded the time complexity T (n) of the algorithm is O(log n), work complexity W(n) is O(n log n), and message complexity M(n) is O(n log 2 n). For the model where P - F is bounded by a fractional polynomial we have T(n) = O(n 1-a log n log log n), W(n) = O(n log n log log n), and M(n) = O(n log 2 n log log n). For the model where P - F is bounded by a poly-logarithm we have T(n) = O(n), W(n) = O(n poly log n), and M(n) = O(n log 2 n poly log n). All bounds are shown to hold with high probability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Estimating Reliability of Workers for Cooperative Distributed Computing

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Dealing with undependable workers in decentralized network supercomputing
Seda Davtyan ... Alexander A Shvartsman
Theoretical Computer Science | VOL. 561
Seda Davtyan, et. al.Seda Davtyan ... Alexander A Shvartsman
13 Oct 2014
Theoretical Computer Science | VOL. 561

Dealing with Undependable Workers in Decentralized Network Supercomputing
Seda Davtyan ... Kishori Konwar
-
Seda Davtyan, et. al.Seda Davtyan ... Kishori Konwar
01 Jan 2013
01 Jan 2013

Brief announcement
Seda Davtyan ... Kishori M Konwar
-
Seda Davtyan, et. al.Seda Davtyan ... Kishori M Konwar
16 Jul 2012
16 Jul 2012

Competing-provers protocols for circuit evaluation
Gillat Kol ... Ran Raz
Theory of Computing | VOL. 10
Gillat Kol, et. al.Gillat Kol ... Ran Raz
01 Jan 2014
Theory of Computing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Estimating Reliability of Workers for Cooperative Distributed Computing

Abstract

Talk to us

Similar Papers