Abstract

Power-of- $d$ -choices is a popular load balancing algorithm for many-server systems such as large-scale data centers. For each incoming job, the algorithm probes $d$ servers, chosen uniformly at random from a total of $N$ servers ( $N$ is the number of servers in the system), and routes the job to the least loaded one. It is well known that power-of- $d$ -choices reduces queueing delays by orders of magnitude compared to the policy that routes each incoming job to a randomly selected server. The question to be addressed in this paper is how large $d$ needs to be so that power-of- $d$ -choices achieves asymptotic zero delay like the join-the-shortest-queue (JSQ) algorithm, which is a special case of power-of- $d$ -choices with $d=N.$ We are interested in the heavy-traffic regime where the load of the system, denoted by $\lambda, $ approaches to one as $N$ increases, and assume $\lambda =1-\gamma N^{-\alpha }$ for $0 This paper establishes that when $d=\Omega \left(\frac{\log N}{1-\lambda }\right)$ , finite buffer size $b$ and $0\leq \alpha the probability that an incoming job is routed to a busy server is asymptotically zero, i.e., a job experiences zero queueing delay with probability one asymptotically; and when $d=O\left(\frac{1}{1-\lambda }\right)$ and infinite buffer size $b=\infty$ , the probability that a job is routed to a busy server is lower bounded by a positive constant independent of $N.$ Therefore, our results show that $d=\Omega \left(\frac{\log N}{1-\lambda }\right)$ is sufficient and almost necessary for achieving zero delay with the power-of- $d$ -choices policy.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call