Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

Guojun Xiong,Jian Li

doi:10.1609/aaai.v37i9.26251

Abstract

Multi-player multi-armed bandit is an increasingly relevant decision-making problem, motivated by applications to cognitive radio systems. Most research for this problem focuses exclusively on the settings that players have full access to all arms and receive no reward when pulling the same arm. Hence all players solve the same bandit problem with the goal of maximizing their cumulative reward. However, these settings neglect several important factors in many real-world applications, where players have limited access to a dynamic local subset of arms (i.e., an arm could sometimes be ``walking'' and not accessible to the player). To this end, this paper proposes a multi-player multi-armed walking bandits model, aiming to address aforementioned modeling issues. The goal now is to maximize the reward, however, players can only pull arms from the local subset and only collect a full reward if no other players pull the same arm. We adopt Upper Confidence Bound (UCB) to deal with the exploration-exploitation tradeoff and employ distributed optimization techniques to properly handle collisions. By carefully integrating these two techniques, we propose a decentralized algorithm with near-optimal guarantee on the regret, and can be easily implemented to obtain competitive empirical performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Gateway Selection in Millimeter Wave UAV Wireless Networks Using Multi-Player Multi-Armed Bandit.
Ehab Mahmoud Mohamed ... Mahmoud Ahmed Abdelghany
Sensors | VOL. 20
Ehab Mahmoud Mohamed, et. al.Ehab Mahmoud Mohamed ... Mahmoud Ahmed Abdelghany
16 Jul 2020
Sensors | VOL. 20

A Practical Multiplayer Multi-armed bandit Algorithm for Smart City Communication System
Shubhjeet Kumar Tiwari ... Sudhanshu Soni
-
Shubhjeet Kumar Tiwari, et. al.Shubhjeet Kumar Tiwari ... Sudhanshu Soni
19 Mar 2021
19 Mar 2021

Multi-Player Multi-Armed Bandits with Finite Shareable Resources Arms: Learning Algorithms & Applications
Xuchuang Wang ... John C S Lui
-
Xuchuang Wang, et. al.Xuchuang Wang ... John C S Lui
01 Jul 2022
01 Jul 2022

Distributed Online Learning for Coexistence in Cognitive Radar Networks
William W Howard ... Anthony F Martone
IEEE Transactions on Aerospace and Electronic Systems | VOL. -
William W Howard, et. al.William W Howard ... Anthony F Martone
01 Jan 2021
IEEE Transactions on Aerospace and Electronic Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Decentralized Stochastic Multi-Player Multi-Armed Walking Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence