The multi-armed bandit, with constraints

Eric V. Denardo,Uriel G. Rothblum,Eugene A. Feinberg

doi:10.1007/s10479-012-1250-y

The multi-armed bandit, with constraints

Eric V. Denardo, Uriel G. Rothblum + Show 1 more

Open Access

https://doi.org/10.1007/s10479-012-1250-y

Copy DOI

Journal: Annals of Operations Research	Publication Date: Nov 13, 2012
Citations: 26

Affiliation: Center for Systems Biology, Yale University, Technion – Israel Institute of Technology, Stony Brook University

#Multi-armed Bandit #Exponential Utility + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Presented in this paper is a self-contained analysis of a Markov decision problem that is known as the multi-armed bandit. The analysis covers the cases of linear and exponential utility functions. The optimal policy is shown to have a simple and easily-implemented form. Procedures for computing such a policy are presented, as are procedures for computing the expected utility that it earns, given any starting state. For the case of linear utility, constraints that link the bandits are introduced, and the constrained optimization problem is solved via column generation. The methodology is novel in several respects, which include the use of elementary row operations to simplify arguments.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Annals of Operations Research

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.