SUMMARY Second-order asymptotic properties of the jackknife procedure are discussed, and the jackknifed estimator is shown to be a vulnerable estimator whose variation can be severely underestimated by the jackknife standard error. Simple robust alternatives to the average pseudovalue are discussed. Particular emphasis is placed on estimation of a correlation coefficient. Numerical examples are given. The jackknife is a method for distribution-free bias reduction and standard error estimation. For a wide class of problems it is known that the jackknife produces consistent results. An excellent review of applications and asymptotic theory is given by Miller (1974). Recently there have been several investigations of small-sample properties of the jackknife procedure (Hin-kley, 1977a, b), which show that some adjustments are necessary in order to obtain accurate confidence intervals using the jackknife. In an unpublished paper, B. Efron ha? shown that the jackknife gives a rough linear approximation to another subsampling method for getting confidence intervals. In the present paper we examine two further aspects of the jackknife, namely the use ol second-order asymptotics in assessing finite-sample properties, and the use of jackknife pseudovalues in obtaining estimates less sensitive to extreme data points. The discussion is illustrated throughout with results for the correlation estimate. A brief summary of the results is as follows: jackknifed estimators can have very large haphazard bias compared to the original estimators; the jackknife estimate of standard error can severely underestimate the standard error of the jackknifed estimator; and use of the jackknife pseudovalues in residual and trimmed-mean analyses can give considerably improved estimators. Section 2 summarizes the standard jackknife method and illustrates it on an artificial data set, where certain difficulties are apparent. Second-order properties of the jackknife are derived in ? 3, and numerical results are given for the correlation example. The same example is used in ? 4, where robust analysis via pseudovalues is discussed. Section 5 gives brief conclusions. Throughout the paper we assume that the basic estimate is obtained from independent, identically distributed random variables. Moreover we assume that the estimate is a regular differentiable functional of the empirical distribution function, with at least two derivatives.
Read full abstract