Decentralized Scheduling with QoS Constraints: Achieving O(1) QoS Regret of Multi-Player Bandits

Qingsong Liu,Zhixuan Fang

doi:10.1609/aaai.v38i12.29306

Abstract

We consider a decentralized multi-player multi-armed bandit (MP-MAB) problem where players cannot observe the actions and rewards of other players and no explicit communication or coordination between players is possible. Prior studies mostly focus on maximizing the sum of rewards of the players over time. However, the total reward maximization learning may lead to imbalanced reward among players, leading to poor Quality of Service (QoS) for some players. In contrast, our objective is to let each player n achieve a predetermined expected average reward over time, i.e., achieving a predetermined level of QoS. We develop a novel decentralized MP-MAB algorithm to accomplish this objective by leveraging the methodology of randomized matching. We prove that our decentralized algorithm can ensure that all players have an O(1) QoS regret. We also reveal an analog between our MP-MAB model and the online wireless queuing systems, which builds a connection between QoS in MP-MAB learning and stability in queuing theory.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decentralized Scheduling with QoS Constraints: Achieving O(1) QoS Regret of Multi-Player Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits
Guojun Xiong ... Jian Li
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 37
Guojun Xiong, et. al.Guojun Xiong ... Jian Li
26 Jun 2023
Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence | VOL. 37

Opportunistic Scheduling with Multiple QoS Constraints in Wireless Multiservice Networks
Dan Liao ... Hongfang Yu
-
Dan Liao, et. al.Dan Liao ... Hongfang Yu
01 Jan 2007
01 Jan 2007

Cognitive radio transmission under QoS constraints and interference limitations
Sami Akin ... Mustafa Cenk Gursoy
EURASIP journal on wireless communications and networking | VOL. 2012
Sami Akin, et. al.Sami Akin ... Mustafa Cenk Gursoy
24 Sep 2012
EURASIP journal on wireless communications and networking | VOL. 2012

On the Achievable Throughput Region of Multiple-Access Fading Channels with QoS Constraints
D Qiao ... M C Gursoy
-
D Qiao, et. al.D Qiao ... M C Gursoy
01 May 2010
01 May 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decentralized Scheduling with QoS Constraints: Achieving O(1) QoS Regret of Multi-Player Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence