Abstract

BackgroundStudies of association methods using DNA pooling of single nucleotide polymorphisms (SNPs) have focused primarily on the effects of "machine-error", number of replicates, and the size of the pool. We use the non-centrality parameter (NCP) for the analysis of variance test to compute the approximate power for genetic association tests with DNA pooling data on cases and controls. We incorporate genetic model parameters into the computation of the NCP. Parameters involved in the power calculation are disease allele frequency, frequency of the marker SNP allele in coupling with the disease locus, disease prevalence, genotype relative risk, sample size, genetic model, number of pools, number of replicates of each pool, and the proportion of variance of the pooled frequency estimate due to machine variability. We compute power for different settings of number of replicates and total number of genotypings when the genetic model parameters are fixed. Several significance levels are considered, including stringent significance levels (due to the increasing popularity of 100 K and 500 K SNP "chip" data). We use a factorial design with two to four settings of each parameter and multiple regression analysis to assess which parameters most significantly affect power.ResultsThe power can increase substantially as the genotyping number increases. For a fixed number of genotypings, the power is a function of the number of replicates of each pool such that there is a setting with maximum power. The four most significant parameters affecting power for association are: (1) genotype relative risk, (2) genetic model, (3) sample size, and (4) the interaction term between disease and SNP marker allele probabilities.ConclusionFor a fixed number of genotypings, there is an optimal number of replicates of each pool that increases as the number of genotypings increases. Power is not substantially reduced when the number of replicates is close to but not equal to the optimal setting.

Highlights

  • Studies of association methods using DNA pooling of single nucleotide polymorphisms (SNPs) have focused primarily on the effects of "machine-error", number of replicates, and the size of the pool

  • With the recent development of technologies that can determine the genotypes for hundreds of thousands of single nucleotide polymorphisms (SNPs) across the human genome, such studies are being reported in the literature [1,2,3]

  • The research question addressed in this work is: assuming a certain level of measurement error, what settings of study design parameters maximize the power to detect association? we study the sensitivity of power to changes in design parameters

Read more

Summary

Introduction

Studies of association methods using DNA pooling of single nucleotide polymorphisms (SNPs) have focused primarily on the effects of "machine-error", number of replicates, and the size of the pool. We use the non-centrality parameter (NCP) for the analysis of variance test to compute the approximate power for genetic association tests with DNA pooling data on cases and controls. With the recent development of technologies that can determine the genotypes for hundreds of thousands of single nucleotide polymorphisms (SNPs) across the human genome, such studies are being reported in the literature [1,2,3] Design issues such as power to detect association using these technologies are being published [4,5]. A key assumption is that the intensity measure has an expected value equal to the allele frequency Another potential disadvantage is that DNA pooling techniques may not detect disease mode of inheritances that deviate from dominant or recessive modes. DNA pooling techniques will be underpowered to detect disease genes that operate in an over-dominant form

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call