Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits

Thibaut Cuvelier,Eric Gourdin,Richard Combes

doi:10.1145/3447387

Abstract

We consider combinatorial semi-bandits over a set of arms X \subset \0,1\ ^d where rewards are uncorrelated across items. For this problem, the algorithm ESCB yields the smallest known regret bound R(T) = O( d (łn m)^2 (łn T) / Δ_\min ) after T rounds, where m = \max_x \in X 1^\top x. However, ESCB it has computational complexity O(|X|), which is typically exponential in d, and cannot be used in large dimensions. We propose the first algorithm that is both computationally and statistically efficient for this problem with regret R(T) = O( d (łn m)^2 (łn T) / Δ_\min ) and computational asymptotic complexity O(δ_T^-1 poly(d)), where δ_T is a function which vanishes arbitrarily slowly. Our approach involves carefully designing AESCB, an approximate version of ESCB with the same regret guarantees. We show that, whenever budgeted linear maximization over X can be solved up to a given approximation ratio, AESCB is implementable in polynomial time O(δ_T^-1 poly(d)) by repeatedly maximizing a linear function over X subject to a linear budget constraint, and showing how to solve these maximization problems efficiently.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Measurement and Analysis of Computing Systems

Lead the way for us

Journal: Proceedings of the ACM on Measurement and Analysis of Computing Systems	Publication Date: Feb 18, 2021
Citations: 2

Similar Papers

Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits
Thibaut Cuvelier ... Eric Gourdin
ACM SIGMETRICS Performance Evaluation Review | VOL. 49
Thibaut Cuvelier, et. al.Thibaut Cuvelier ... Eric Gourdin
22 Jun 2021
ACM SIGMETRICS Performance Evaluation Review | VOL. 49

Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits
Thibaut Cuvelier ... Eric Gourdin
-
Thibaut Cuvelier, et. al.Thibaut Cuvelier ... Eric Gourdin
31 May 2021
31 May 2021

An efficient QoS routing algorithm for solving MCP in ad hoc networks
Noureddine Kettaf ... Hafid Abouaissa
Telecommunication Systems | VOL. 33
Noureddine Kettaf, et. al.Noureddine Kettaf ... Hafid Abouaissa
12 Oct 2006
Telecommunication Systems | VOL. 33

Deterministic Approximation Algorithms for the Nearest Codeword Problem
Noga Alon ... Rina Panigrahy
-
Noga Alon, et. al.Noga Alon ... Rina Panigrahy
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Measurement and Analysis of Computing Systems