Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme

Konstantinos E Nikolakakis,Anand D Sarwate,Dionysios S Kalogerias,Or Sheffet

doi:10.1109/jsait.2021.3081525

Abstract

We study the best-arm identification problem in multi-armed bandits with stochastic rewards when the goal is to identify the arm with the highest quantile at a fixed, prescribed level. First, we propose a successive elimination algorithm for strictly optimal best-arm identification, show that it is δ-PAC and characterize its sample complexity. Further, we provide a lower bound on the expected number of pulls, showing that the proposed algorithm is essentially optimal up to logarithmic factors. Both upper and lower complexity bounds depend on a special definition of the associated suboptimality gap, designed in particular for the quantile bandit problem - as we show, when the gap approaches zero, best-arm identification is impossible. Second, motivated by applications where the rewards are private information, we provide a differentially private successive elimination algorithm whose sample complexity is finite even for distributions with infinite support and characterize its sample complexity. Our algorithms do not require prior knowledge of either the suboptimality gap or other statistical information related to the bandit problem at hand.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Information Theory

Lead the way for us

Journal: IEEE Journal on Selected Areas in Information Theory	Publication Date: Jun 1, 2021
Citations: 4

Similar Papers

A day at the races
David E Losada ... David Elsweiler
Applied Intelligence | VOL. 52
David E Losada, et. al.David E Losada ... David Elsweiler
17 Aug 2021
Applied Intelligence | VOL. 52

Best Arm Identification under Additive Transfer Bandits
Ojash Neopane ... Aaditya Ramdas
-
Ojash Neopane, et. al.Ojash Neopane ... Aaditya Ramdas
31 Oct 2021
31 Oct 2021

Best Arm Identification in Spectral Bandits
Tomáš Kocák ... Aurélien Garivier
-
Tomáš Kocák, et. al.Tomáš Kocák ... Aurélien Garivier
01 Jul 2020
01 Jul 2020

Best-Arm Identification in Correlated Multi-Armed Bandits
Samarth Gupta ... Gauri Joshi
IEEE Journal on Selected Areas in Information Theory | VOL. 2
Samarth Gupta, et. al.Samarth Gupta ... Gauri Joshi
01 Jun 2021
IEEE Journal on Selected Areas in Information Theory | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme

Abstract

Talk to us

Similar Papers

More From: IEEE Journal on Selected Areas in Information Theory