Asymptotically optimal algorithms for budgeted multiple play bandits

Alex Luedtke,Antoine Chambaz,Emilie Kaufmann

doi:10.1007/s10994-019-05799-x

Asymptotically optimal algorithms for budgeted multiple play bandits

Alex Luedtke, Antoine Chambaz + Show 1 more

Open Access

https://doi.org/10.1007/s10994-019-05799-x

Copy DOI

Journal: Machine Learning	Publication Date: May 16, 2019
Citations: 2

Affiliation: University of Washington, Seattle University, Délégation Paris 5, Université Paris Cité, Inria research centre Lille - Nord Europe, University of Lille, French National Centre for Scientific Research

#Decision Boundary #Asymptotic Regret + Show 7 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We study a generalization of the multi-armed bandit problem with multiple plays where there is a cost associated with pulling each arm and the agent has a budget at each time that dictates how much she can expect to spend. We derive an asymptotic regret lower bound for any uniformly efficient algorithm in our setting. We then study a variant of Thompson sampling for Bernoulli rewards and a variant of KL-UCB for both single-parameter exponential families and bounded, finitely supported rewards. We show these algorithms are asymptotically optimal, both in rate and leading problem-dependent constants, including in the thick margin setting where multiple arms fall on the decision boundary.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Machine Learning

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.