Abstract

Abstract The effect of stratification and clustering on the asymptotic distributions of standard Pearson chi-squared test statistics for goodness of fit (simple hypothesis) and independence in a two-way contingency table, denoted as X 2 and XI 2, respectively, is investigated. It is shown that both X 2 and XI 2 are asymptotically distributed as weighted sums of independent χ1 2 random variables. The weights are then related to the familiar design effects (deffs) used by survey samplers. A simple correction to X 2, which requires only the knowledge of variance estimates (or deffs) for individual cells in the goodness-of-fit problem, is proposed and empirical results on the performance of corrected X 2 provided. Empirical work on XI 2 indicated that the distortion of nominal significance level is substantially smaller with XI 2 than with X 2. Some results under simple models for clustering are also given.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call