Abstract

BackgroundIn the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies. Misclassification also can bias population frequency parameters such as genotype, haplotype, or multi-locus genotype frequencies. These problems are of particular concern in case/control designs because, short of repeated sampling, there is no way to detect misclassification errors.We developed a double-sampling procedure for case/control genetic association using a likelihood ratio test framework. Different approaches have been proposed to deal with misclassification errors. We have chosen the likelihood framework because of the ease with which misclassification probabilities may be incorporated into in the statistical framework and hypothesis testing. The statistic is called the Likelihood Ratio Test allowing for errors (LRTae) and is freely available via software download.ResultsWe applied our procedure to 10,000 replicates of simulated case/control data in which we introduced phenotype misclassification errors. The phenotype considered is Ankylosing Spondylitis (AS). The LRTae method power was always greater than LRTstd power for the significance levels considered (5%, 1%, 0.1%, 0.01%). Power gains for the LRTae method over the LRTstd method increased as the significance level became more stringent. Multi-locus genotype frequency estimates using LRTae method were more accurate than estimates using LRTstd method.ConclusionThe LRTae method can be applied to single-locus genotypes, multi-locus genotypes, or multi-locus haplotypes in a case/control framework and can be more powerful to detect association in case/control studies when both genotype and/or phenotype errors are present. Furthermore, the LRTae method provides asymptotically unbiased estimates of case and control genotype frequencies, as well as estimates of phenotype and/or genotype misclassification rates.

Highlights

  • In the field of statistical genetics, phenotype and genotype misclassification errors can substantially reduce power to detect association with genetic case/control studies

  • We applied our procedure to 10,000 replicates of simulated case/control data in which we introduced phenotype misclassification errors

  • The Likelihood Ratio Test allowing for errors (LRTae) method can be applied to single-locus genotypes, multi-locus genotypes, or multi-locus haplotypes in a case/control framework and can be more powerful to detect association in case/control studies when both genotype and/or phenotype errors are present

Read more

Summary

Results

We simulated 10,000 replicates of case/control genetic association data to evaluate power and estimation of multi-locus genotypes frequencies for the LRTae and LRTstd (the likelihood ratio test that does not include doublesample information) methods. The code "3 : 3 : 3" is the multi-locus genotype consisting of the more common homozygote at each of the three SNP loci Note that these generating frequencies suggest a recessive mode of inheritance for AS in our simulations, since greatest risk occurs for individuals who are homozygous for each of the three SNPs (code 3 : 3 : 3). We comment that the LRTae mean estimates of multilocus genotype frequencies in cases and controls and misclassification error rates were very accurate (data not shown) with relatively small variances

Conclusion
Background
Gordon D
13. Bross I
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call