Ballooning Multi Armed Bandits

Sujit Gujar ,Ganesh Ghalme ,Yadati Narahari ,Swapnil Dhamal ,Shweta Jain

doi:10.48448/b7ck-r445

Abstract

We introduce ballooning multi-armed bandits (BL-MAB), a novel extension to the classical stochastic MAB model. In the BL-MAB model, the set of available arms grows (or balloons) over time. The regret in a BL-MAB setting is computed with respect to the best available arm at each time. We first observe that the existing stochastic MAB algorithms are not regret-optimal for the BL-MAB model. We show that if the best arm is equally likely to arrive at any time, a sub-linear regret cannot be achieved, irrespective of the arrival of the other arms. We further show that if the best arm is more likely to arrive in the early rounds, one can achieve sub-linear regret. Making reasonable assumptions on the arrival distribution of the best arm in terms of the thinness of the distribution’s tail, we prove that the proposed algorithm achieves sub-linear instance-independent regret. We further quantify explicit dependence of regret on the arrival distribution parameters.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ballooning Multi Armed Bandits

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Ballooning multi-armed bandits
Ganesh Ghalme ... Y Narahari
Artificial Intelligence | VOL. 296
Ganesh Ghalme, et. al.Ganesh Ghalme ... Y Narahari
24 Feb 2021
Artificial Intelligence | VOL. 296

Evidence for universality within the classes of deterministic and stochastic sandpile models.
Ofer Biham ... Ofer Malcai
Physical review. E, Statistical, nonlinear, and soft matter physics | VOL. 63
Ofer Biham, et. al.Ofer Biham ... Ofer Malcai
23 May 2001
Physical review. E, Statistical, nonlinear, and soft matter physics | VOL. 63

Integral Transforms of Distribution Densities
Archil Gulisashvili
-
Archil GulisashviliArchil Gulisashvili
01 Jan 2012
01 Jan 2012

A novel ex-post truthful mechanism for multi-slot sponsored search auctions
...
-
, et. al. ...
05 May 2014
05 May 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ballooning Multi Armed Bandits

Abstract

Talk to us

Similar Papers