Abstract

Suppose we are allowed to view successively as many terms as we please of a sequence Xi, X2, * * * of independent random variables with common distribution F. We can decide to stop viewing at any time and if we decide to stop at time n, we receive the payoff (Xi + * * * + X, )/n. How should we choose a stopping rule in order to maximize the expected payoff? This problem was introduced [5] in the context of inducing an illusory bias by selectively stopping an ESP experiment. Based on their general theory of optimal stopping rules, Y. S. Chow and Herbert Robbins [9] succeeded in proving that an optimal rule exists when F is a two point distribution. They also proved the intuitively obvious but nontrivial fact that the unique minimal optimal rule is to stop at the first n at which Xi + * * * + X. _ O ,, where ,B, B2, . . . is a sequence of numbers, and gave a way to calculate 0. in principle. Aryeh Dvoretzky [14], and also H. Teicher and J. Wolfowitz [25] then proved that the same results hold for any F with finite second moment (the ,B's depend on F, of course). Dvoretzky also showed that if F has zero mean and unit variance then 0.32 0 is sampled continuously and stopping at time t gets the payoff W (t)/ (a + t). Dvoretzky pointed out that if a > 0 there exists an optimal stopping time. We show that there is a unique optimal stopping time and we find it explicitly: it is the first time r that W (r) = a (a + r)I (the same a as above). The expected payoff under the optimal rule is also given explicitly (Theorem 1). Except for the constant a, the parabolic form of the boundary determining r is easily guessed by using the invariance of W under a change of scale. a can then be determined by using the principle of smooth fit, due to Herman Chernoff and others for various special problems and treated carefully and in some generality by B. I. Grigelionis and A. N. Shiryaev [18]. However, to prove the optimality of r rigorously we use a different approach, based on the fundamental Wald identity and on the work of Chow, Robbins, and Dvoretzky. The continuous time problem discussed above is basically similar to the familiar Wald sequential probability ratio problem where, again on heuristic considerations of homogeneity, the optimal stopping rule is given in terms of a pair of

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call