Reinforcement Learning for Optimal Control of Queueing Systems

Bai Liu,Eytan Modiano,Qiaomin Xie

doi:10.1109/allerton.2019.8919665

Abstract

With the rapid advance of information technology, network systems have become increasingly complex and hence the underlying system dynamics are typically unknown or difficult to characterize. Finding a good network control policy is of significant importance to achieving desirable network performance (e.g., high throughput or low average job delay). Online/sequential learning algorithms are well-suited to learning the optimal control policy from observed data for systems without the information of underlying dynamics. In this work, we consider using model-based reinforcement learning (RL) to learn the optimal control policy of queueing networks so that the average job delay (or equivalently the average queue backlog) is minimized. Existing RL techniques, however, cannot handle the unbounded state spaces of the network control problem. To overcome this difficulty, we propose a new algorithm, called Piecewise Decaying ε-Greedy Reinforcement Learning (PDGRL), which applies model-based RL methods over a finite subset of the state space. We establish that the average queue backlog under PDGRL with an appropriately constructed subset can be arbitrarily close to the optimal result. We evaluate PDGRL in dynamic server allocation and routing problems. Simulations show that PDGRL minimizes the average queue backlog effectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Optimal Control of Queueing Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems
Bai Liu ... Eytan Modiano
ACM Transactions on Modeling and Performance Evaluation of Computing Systems | VOL. 7
Bai Liu, et. al.Bai Liu ... Eytan Modiano
31 Mar 2022
ACM Transactions on Modeling and Performance Evaluation of Computing Systems | VOL. 7

Safe Model-Based Reinforcement Learning for Systems With Parametric Uncertainties.
S M Nahid Mahmud ... Rushikesh Kamalapurkar
Frontiers in Robotics and AI | VOL. 8
S M Nahid Mahmud, et. al.S M Nahid Mahmud ... Rushikesh Kamalapurkar
16 Dec 2021
Frontiers in Robotics and AI | VOL. 8

Towards Generalization and Efficiency in Reinforcement Learning

-

02 Jul 2019
02 Jul 2019

DataSheet1.pdf
-
-
--
16 Dec 2021
16 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Optimal Control of Queueing Systems

Abstract

Talk to us

Similar Papers