Abstract

To the Editor:The standard permutation approach cannot be applied to two-stage association studies, because a marker that was not originally selected for the second stage of the study may be selected after permutation. To get around this difficulty, Frank Dudbridge proposes1xA note on permutation tests in multistage association scans. Dudbridge, F. Am J Hum Genet. 2006; 78: 1094–1095Abstract | Full Text | Full Text PDF | PubMed | Scopus (19)See all References1 (in this issue) to simulate a two-stage design by using only the first-stage subjects. This is a very clever idea and seems to be in a spirit similar to my Monte Carlo method,2xEvaluating statistical significance in two-stage genomewide association studies. Lin, DY. Am J Hum Genet. 2006; 78: 505–509Abstract | Full Text | Full Text PDF | PubMed | Scopus (45)See all References2 in that both methods use only the data from the first-stage subjects to estimate the correlations of the test statistics. I believe that Dudbridge’s permutation method (implicitly) requires that the same design (in terms of the proportion of subjects used in the first stage) be adopted in the permutation process as in the original study; otherwise, the joint distribution between the two stages obtained by permutation will not properly reflect the true joint distribution.I wish to respond briefly to Dudbridge’s comment that my Monte Carlo method “is limited to analysis based on efficient score functions and does not use permutation.”1xA note on permutation tests in multistage association scans. Dudbridge, F. Am J Hum Genet. 2006; 78: 1094–1095Abstract | Full Text | Full Text PDF | PubMed | Scopus (19)See all References1 As mentioned in my report,2xEvaluating statistical significance in two-stage genomewide association studies. Lin, DY. Am J Hum Genet. 2006; 78: 505–509Abstract | Full Text | Full Text PDF | PubMed | Scopus (45)See all References2 all test statistics can be represented by efficient score functions. Thus, the use of efficient score functions in generating the null distribution of the test statistics does not, in any way, limit the scope of application. As discussed in an earlier article,3xAn efficient Monte Carlo approach to assessing statistical significance in genomic studies. Lin, DY. Bioinformatics. 2005; 21: 781–787Crossref | PubMed | Scopus (106)See all References3 the Monte Carlo approach has important advantages over the permutation approach. First, the permutation approach requires repeated calculations of the test statistics for each permuted data set, which can be prohibitively time consuming if the calculation of each test statistic is nontrivial, as will be the case if proper statistical methods are employed to test haplotype-disease associations,4xMaximum likelihood estimation of haplotype effects and haplotype-environment interactions in association studies. Lin, DY, Zeng, D, and Millikan, R. Genet Epidemiol. 2005; 29: 299–312Crossref | PubMed | Scopus (89)See all References4 whereas the Monte Carlo approach involves simulation of normal random variables only and is thus very efficient. Second, the permutation method can be used only to test the global null hypothesis that the variable being permuted is independent of all other variables and cannot be used to test, for example, gene-environment interactions, whereas the Monte Carlo approach can be used to test any kind of hypothesis.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call