Distributed Robust Bandits With Efficient Communication

Ao Wang,Lin Gao,Zhida Qin,Lu Zheng,Dapeng Li

doi:10.1109/tnse.2022.3231320

Abstract

The Distributed Multi-Armed Bandit (DMAB) is a powerful framework for studying many network problems. The DMAB is typically studied in a paradigm, where signals activate each agent with a fixed probability, and the rewards revealed to agents are assumed to be generated from fixed and unknown distributions, i.e., stochastic rewards, or arbitrarily manipulated by an adversary, i.e., adversarial rewards. However, this paradigm fails to capture the dynamics and uncertainties of many real-world applications, where the signal that activates an agent, may not follow any distribution, and the rewards might be partially stochastic and partially adversarial. Motivated by this, we study the asynchronously stochastic DMAB problem with adversarial corruptions where the agent is activated arbitrarily, and rewards initially sampled from distributions might be corrupted by an adversary. The objectives are to simultaneously minimize the regret and communication cost, while robust to corruption. To address all these issues, we propose a Robust and Distributed Active Arm Elimination algorithm, namely RDAAE, which only needs to transmit one real number (e.g., an arm index, or a reward) per communication. We theoretically prove that the performance of regret and communication cost smoothly degrades when the corruption level increases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Distributed Robust Bandits With Efficient Communication

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Network Science and Engineering

Lead the way for us

Similar Papers

Stochastic Graphical Bandits with Adversarial Corruptions
Shiyin Lu ... Lijun Zhang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Shiyin Lu, et. al.Shiyin Lu ... Lijun Zhang
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Online Matching with Stochastic Rewards
Aranyak Mehta ... Debmalya Panigrahi
-
Aranyak Mehta, et. al.Aranyak Mehta ... Debmalya Panigrahi
01 Oct 2012
01 Oct 2012

The Generalized Magician Problem under Unknown Distributions and Related Applications
...
-
, et. al. ...
20 Apr 2022
20 Apr 2022

The Generalized Magician Problem under Unknown Distributions and Related Applications
...
-
, et. al. ...
20 Apr 2022
20 Apr 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distributed Robust Bandits With Efficient Communication

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Network Science and Engineering