Mistake bounds on the noise-free multi-armed bandit game

Atsuyoshi Nakamura,David P Helmbold,Manfred K Warmuth

doi:10.1016/j.ic.2019.104453

Mistake bounds on the noise-free multi-armed bandit game

Atsuyoshi Nakamura, David P Helmbold + Show 1 more

Open Access

https://doi.org/10.1016/j.ic.2019.104453

Copy DOI

Journal: Information and Control	Publication Date: Aug 27, 2019
Citations: 1	License type: publisher-specific-oa

Affiliation: Hokkaido University, University of California, Santa Cruz

#Number Of Arms #Multi-armed Bandit + Show 6 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We study the {0,1}-loss version of adaptive adversarial multi-armed bandit problems with α(≥1) lossless arms. For the problem, we show a tight bound K−α−Θ(1/T) on the minimax expected number of mistakes (1-losses), where K is the number of arms and T is the number of rounds.

Full Text