Abstract

We address the asymptotic and approximate distributions of a large class of test statistics with quadratic forms used in association studies. The statistics of interest take the general form D=X(T)A X, where A is a general similarity matrix which may or may not be positive semi-definite, and X follows the multivariate normal distribution with mean mu and variance matrix Sigma, where Sigma may or may not be singular. We show that D can be written as a linear combination of independent chi(2) random variables with a shift. Furthermore, its distribution can be approximated by a chi(2) or the difference of two chi(2) distributions. In the setting of association testing, our methods are especially useful in two situations. First, when the required significance level is much smaller than 0.05 such as in a genome scan, the estimation of p-values using permutation procedures can be challenging. Second, when an EM algorithm is required to infer haplotype frequencies from un-phased genotype data, the computation can be intensive for a permutation procedure. In either situation, an efficient and accurate estimation procedure would be useful. Our method can be applied to any quadratic form statistic and therefore should be of general interest.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call