Self-Learning Threshold-Based Load Balancing

Diego Goldsztajn,Johan S. H. van Leeuwaarden,Philip A. Whiting,Sem C. Borst,Debankur Mukherjee

doi:10.1287/ijoc.2021.1100

Abstract

We consider a large-scale service system where incoming tasks have to be instantaneously dispatched to one out of many parallel server pools. The user-perceived performance degrades with the number of concurrent tasks and the dispatcher aims at maximizing the overall quality of service by balancing the load through a simple threshold policy. We demonstrate that such a policy is optimal on the fluid and diffusion scales, while only involving a small communication overhead, which is crucial for large-scale deployments. In order to set the threshold optimally, it is important, however, to learn the load of the system, which may be unknown. For that purpose, we design a control rule for tuning the threshold in an online manner. We derive conditions that guarantee that this adaptive threshold settles at the optimal value, along with estimates for the time until this happens. In addition, we provide numerical experiments that support the theoretical results and further indicate that our policy copes effectively with time-varying demand patterns. Summary of Contribution: Data centers and cloud computing platforms are the digital factories of the world, and managing resources and workloads in these systems involves operations research challenges of an unprecedented scale. Due to the massive size, complex dynamics, and wide range of time scales, the design and implementation of optimal resource-allocation strategies is prohibitively demanding from a computation and communication perspective. These resource-allocation strategies are essential for certain interactive applications, for which the available computing resources need to be distributed optimally among users in order to provide the best overall experienced performance. This is the subject of the present article, which considers the problem of distributing tasks among the various server pools of a large-scale service system, with the objective of optimizing the overall quality of service provided to users. A solution to this load-balancing problem cannot rely on maintaining complete state information at the gateway of the system, since this is computationally unfeasible, due to the magnitude and complexity of modern data centers and cloud computing platforms. Therefore, we examine a computationally light load-balancing algorithm that is yet asymptotically optimal in a regime where the size of the system approaches infinity. The analysis is based on a Markovian stochastic model, which is studied through fluid and diffusion limits in the aforementioned large-scale regime. The article analyzes the load-balancing algorithm theoretically and provides numerical experiments that support and extend the theoretical results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: INFORMS Journal on Computing	Publication Date: Sep 16, 2021
Citations: 1	License type: other-oa

R Discovery Prime

R Discovery Prime

Self-Learning Threshold-Based Load Balancing

Abstract

Talk to us

Similar Papers

More From: INFORMS Journal on Computing

Lead the way for us

Similar Papers

Application of Viral System Algorithm in Load Balancing of Cloud Environment
Damodar Tiwari ... Sanjeev Sharma
Journal of Computer Science | VOL. 14
Damodar Tiwari, et. al.Damodar Tiwari ... Sanjeev Sharma
01 Jul 2018
Journal of Computer Science | VOL. 14

Tightness of invariant distributions of a large-scale flexible service system under a priority discipline
Alexander L Stolyar ... Elena Yudovina
Stochastic Systems | VOL. 2
Alexander L Stolyar, et. al.Alexander L Stolyar ... Elena Yudovina
01 Jan 2012
Stochastic Systems | VOL. 2

Tightness of Invariant Distributions of a Large-scale Flexible Service System Under a Priority Discipline
Alexander L Stolyar ... Elena Yudovina
Stochastic Systems | VOL. 2
Alexander L Stolyar, et. al.Alexander L Stolyar ... Elena Yudovina
01 Dec 2012
Stochastic Systems | VOL. 2

Efficient load balancing in large-scale systems
D Mukherjee ... S.C Borst
-
D Mukherjee, et. al.D Mukherjee ... S.C Borst
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Self-Learning Threshold-Based Load Balancing

Abstract

Talk to us

Similar Papers

More From: INFORMS Journal on Computing