The Multi-Armed Bandit With Stochastic Plays

Antoine Lesage-Landry,Joshua A Taylor

doi:10.1109/tac.2017.2765501

The Multi-Armed Bandit With Stochastic Plays

Antoine Lesage-Landry, Joshua A Taylor

https://doi.org/10.1109/tac.2017.2765501

Copy DOI

Journal: IEEE Transactions on Automatic Control	Publication Date: Jul 1, 2018
Citations: 38

Affiliation: University of Toronto

#Demand Response In Power Systems #Demand Response + Show 7 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We extend the stochastic multi-armed bandit to the case where the number of arms to play evolves as a stationary process. Our work is motivated by demand response in power systems, in which the number of arms to play, or loads to dispatch, depends on a random power imbalance. We give an upper confidence bound-based algorithm that achieves sublinear pseudo-regret. We apply our results in several examples from demand response.

Full Text