Abstract

Let X 1, X 2,…X n be i.i.d. random variables with a known continuous distribution function. Robbins’ problem is to find a sequential stopping rule without recall which minimizes the expected rank of the selected observation. An upper bound (obtained by memoryless threshold rules) and a procedure to obtain lower bounds of the value are known, but the difficulty is that the optimal strategy depends for all n > 2 in an intractable way on the whole history of preceding observations. The goal of this article is to understand better the structure of both optimal memoryless threshold rules and the (overall) optimal rule. We prove that the optimal rule is a “stepwise” monotone increasing threshold-function rule and then study its property of, what we call, full history-dependence. For each n, we describe a tractable statistic of preceding observations which is sufficient for optimal decisions of decision makers with half-prophetical abilities who can do generally better than we. It is shown that their advice can always be used to improve strictly on memoryless rules, and we determine such an improved rule for all n sufficiently large. We do not know, however, whether one can construct, as n → ∞ asymptotically relevant improvements.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call