A confirmation of a conjecture on Feldman’s two-armed bandit problem

Jichen Zhang,Zengjing Chen,Yiwei Lin

doi:10.1017/jpr.2023.24

A confirmation of a conjecture on Feldman’s two-armed bandit problem

Jichen Zhang, Zengjing Chen

Open Access

https://doi.org/10.1017/jpr.2023.24

Copy DOI

Journal: Journal of Applied Probability

Publication Date: May 26, 2023

Affiliation: Shandong University

#General Utility Function #Two-armed Bandit Problem + Show 7 more

Abstract
Full-Text PDF
Similar Papers

Abstract

AbstractThe myopic strategy is one of the most important strategies when studying bandit problems. In 2018, Nouiehed and Ross put forward a conjecture about Feldman’s bandit problem (J. Appl. Prob. (2018) 55, 318–324). They proposed that for Bernoulli two-armed bandit problems, the myopic strategy stochastically maximizes the number of wins. In this paper we consider the two-armed bandit problem with more general distributions and utility functions. We confirm this conjecture by proving a stronger result: if the agent playing the bandit has a general utility function, the myopic strategy is still optimal if and only if this utility function satisfies reasonable conditions.

Full Text