Mean Waiting Time in Large-Scale and Critically Loaded Power of d Load Balancing Systems

Tim Hellemans,Benny Van Houdt

doi:10.1145/3460086

Abstract

Mean field models are a popular tool used to analyse load balancing policies. In some exceptional cases the waiting time distribution of the mean field limit has an explicit form. In other cases it can be computed as the solution of a set of differential equations. In this paper we study the limit of the mean waiting time E[Wλ] as the arrival rate λ approaches 1 for a number of load balancing policies in a large-scale system of homogeneous servers which finish work at a constant rate equal to one and exponential job sizes with mean 1 (i.e. when the system gets close to instability). As E[Wλ] diverges to infinity, we scale with -log(1-λ) and present a method to compute the limit limλ-> 1- -E[Wλ]/l(1-λ). We show that this limit has a surprisingly simple form for the load balancing algorithms considered. More specifically, we present a general result that holds for any policy for which the associated differential equation satisfies a list of assumptions. For the well-known LL(d) policy which assigns an incoming job to a server with the least work left among d randomly selected servers these assumptions are trivially verified. For this policy we prove the limit is given by 1/d-1. We further show that the LL(d,K) policy, which assigns batches of K jobs to the K least loaded servers among d randomly selected servers, satisfies the assumptions and the limit is equal to K/d-K. For a policy which applies LL(di) with probability pi, we show that the limit is given by 1/ ∑i pi di - 1. We further indicate that our main result can also be used for load balancers with redundancy or memory. In addition, we propose an alternate scaling -l(pλ) instead of -l(1-λ), where pλ is adapted to the policy at hand, such that limλ-> 1- -E[Wλ]/l(1-λ)=limλ-> 1- -E[Wλ]/l(pλ), where the limit limλ-> 0+ -E[Wλ]/l(pλ) is well defined and non-zero (contrary to limλ-> 0+ -E[Wλ]/l(1-λ)). This allows to obtain relatively flat curves for -E[Wλ]/l(pλ) for λ ∈ [0,1] which indicates that the low and high load limits can be used as an approximation when λ is close to one or zero. Our results rely on the earlier proven ansatz which asserts that for certain load balancing policies the workload distribution of any finite set of queues becomes independent of one another as the number of servers tends to infinity.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mean Waiting Time in Large-Scale and Critically Loaded Power of d Load Balancing Systems

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Measurement and Analysis of Computing Systems

Lead the way for us

Journal: Proceedings of the ACM on Measurement and Analysis of Computing Systems	Publication Date: Jun 1, 2021
Citations: 3

Similar Papers

Mean Waiting Time in Large-Scale and Critically Loaded Power of d Load Balancing Systems
Tim Hellemans ... Benny Van Houdt
-
Tim Hellemans, et. al.Tim Hellemans ... Benny Van Houdt
31 May 2021
31 May 2021

Mean Waiting Time in Large-Scale and Critically Loaded Power of d Load Balancing Systems
Benny Van Houdt ... Tim Hellemans
ACM SIGMETRICS Performance Evaluation Review | VOL. 49
Benny Van Houdt, et. al.Benny Van Houdt ... Tim Hellemans
22 Jun 2021
ACM SIGMETRICS Performance Evaluation Review | VOL. 49

Cluster load balancing for fine-grain network services
Lingkun Chu ... Kai Shen
-
Lingkun Chu, et. al. Lingkun Chu ... Kai Shen
01 Jan 2002
01 Jan 2002

WACA: Workload and cache aware load balancing policy for web services
Raja Chiky ... Sylvain Lefebvre
-
Raja Chiky, et. al.Raja Chiky ... Sylvain Lefebvre
01 Aug 2012
01 Aug 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mean Waiting Time in Large-Scale and Critically Loaded Power of d Load Balancing Systems

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Measurement and Analysis of Computing Systems