Index-based policies for discounted multi-armed bandits on parallel machines

K D Glazebrook,D J Wilkinson

doi:10.1214/aoap/1019487512

Index-based policies for discounted multi-armed bandits on parallel machines

K D Glazebrook, D J Wilkinson

Open Access

https://doi.org/10.1214/aoap/1019487512

Copy DOI

Journal: The Annals of Applied Probability	Publication Date: Aug 1, 2000
Citations: 16

#Parallel Machines #Limit Policies + Show 3 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We utilize and develop elements of the recent achievable region account of Gittins indexation by Bertsimas and Niño-Mora to design index-based policies for discounted multi-armed bandits on parallel machines. The policies analyzed have expected rewards which come within an $O(\alpha)$ quantity of optimality, where $\alpha > 0$ is a discount rate. In the main, the policies make an initial once for all allocation of bandits to machines, with each machine then handling its own workload optimally. This allocation must take careful account of the index structure of the bandits. The corresponding limit policies are average-overtaking optimal.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: The Annals of Applied Probability

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.