This paper presents an algorithm which may be used to compute the distribution of the number of successes in a sequence of binary trials which displays dependency. The case of independent trials is treated even in elementary texts, and these results are frequently extended in the literature to the case of first-order Markovian dependence. However, results are scarce for higher-order Markovian dependence. In this article, an algorithm is developed for computing the distribution of the number of successes in binary sequences, under the assumption that the dependence structure is fourth-order Markovian. The importance of using the best model order for a particular data set is discussed. Examination of the computed distribution estimates for various orders may assist in model determination. Scope and purpose Many statistical applications may be modelled as a sequence of n dependent trials, with outcomes which may be classified as either a “success” or a “failure”. This paper presents an algorithm which may be used to compute the distribution of the number of successes in such sequences. It is assumed that the probability of success on the nth trial depends on the outcome of trials n−v, n−v+1, …, n−1 , for some finite value of v, but is independent of trials before n− v. The algorithm extends algorithms given by Kedem [1] for v=1 or 2 to the case where v=4. The importance of selecting an appropriate value for v when modelling dependent trials is discussed.
Read full abstract