Multi-armed bandits in metric spaces

Robert Kleinberg,Aleksandrs Slivkins,Eli Upfal

doi:10.1145/1374376.1374475

Robert Kleinberg, Aleksandrs Slivkins + Show 1 more

Open Access

https://doi.org/10.1145/1374376.1374475

Copy DOI

Abstract

In a multi-armed bandit problem, an online algorithm chooses from a set of strategies in a sequence of $n$ trials so as to maximize the total payoff of the chosen strategies. While the performance of bandit algorithms with a small finite strategy set is quite well understood, bandit problems with large strategy sets are still a topic of very active investigation, motivated by practical applications such as online auctions and web advertisement. The goal of such research is to identify broad and natural classes of strategy sets and payoff functions which enable the design of efficient solutions. In this work we study a very general setting for the multi-armed bandit problem in which the strategies form a metric space, and the payoff function satisfies a condition with respect to the metric. We refer to this problem as the Lipschitz MAB problem. We present a complete solution for the multi-armed problem in this setting. That is, for every metric space (L,X) we define an isometry invariant Max Min COV(X) which bounds from below the performance of MAB algorithms for $X$, and we present an algorithm which comes arbitrarily close to meeting this bound. Furthermore, our technique gives even better results for benign payoff functions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-armed bandits in metric spaces

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Bandits and Experts in Metric Spaces
Robert Kleinberg ... Eli Upfal
Journal of the ACM | VOL. 66
Robert Kleinberg, et. al.Robert Kleinberg ... Eli Upfal
31 May 2019
Journal of the ACM | VOL. 66

Interactive strategy sets in multiple payoff games
Susan X Li
Computers & Industrial Engineering | VOL. 37
Susan X LiSusan X Li
01 Nov 1999
Computers & Industrial Engineering | VOL. 37

In-depth Exploration and Implementation of Multi-Armed Bandit Models Across Diverse Fields
Jiazhen Wu
Highlights in Science, Engineering and Technology | VOL. 94
Jiazhen WuJiazhen Wu
26 Apr 2024
Highlights in Science, Engineering and Technology | VOL. 94

An Optimal Algorithm for the Stochastic Bandits While Knowing the Near-Optimal Mean Reward.
Shangdong Yang ... Yang Gao
IEEE transactions on neural networks and learning systems | VOL. 32
Shangdong Yang, et. al.Shangdong Yang ... Yang Gao
01 May 2021
IEEE transactions on neural networks and learning systems | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-armed bandits in metric spaces

Abstract

Talk to us

Similar Papers