A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts

Minghui Wang,Zewei Luo,Ning Jiang,Lin Wang,Tianye Jia

doi:10.1186/1471-2164-14-88

Abstract

BackgroundThe theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc.ResultsWe implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8, without invoking false positive risk.ConclusionsWe developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS.

Highlights

The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus
It can be seen from the stage I data analysis (Figure 1a) that Method 1 developed in the present study detected 44 significant SNPs, which are distributed across 25 chromosomal regions of size < 1 Mb (Table 2)
No extra significant association was detected by Method 2 or 3 outside the 25 regions screened by Method 1

Summary

Introduction

The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. The theoretical kernel of these genetic association studies is statistical inference of linkage disequilibrium (LD) between a tested polymorphic marker locus and a putative trait locus in the population of interest. Rarely account for the biases described above [9] Such approaches are statistically less sophisticated and often not robust in the presence of these influences, exposing the corresponding analyses to the risk of false positive and/or negative inferences of genetic association. We present here a novel likelihood-based statistical framework that confers improved robustness in estimation of model parameters to non-randomness of samples and a more powerful statistical test of LD in the presence or the absence of genetic structure. We illustrate the statistical properties of the method by computer simulation study

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Feb 8, 2013
Citations: 32	License type: cc-by

R Discovery Prime

R Discovery Prime

A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

ATRIUM: Testing Untyped SNPs in Case-Control Association Studies with Related Individuals
Zuoheng Wang ... Mary Sara Mcpeek
American Journal of Human Genetics | VOL. 85
Zuoheng Wang, et. al.Zuoheng Wang ... Mary Sara Mcpeek
01 Nov 2009
American Journal of Human Genetics | VOL. 85

Impact on venous thrombosis risk of newly discovered gene variants associated with FVIII and VWF plasma levels
P.‐E Morange ... D.‐A Trégouët
Journal of thrombosis and haemostasis : JTH | VOL. 9
P.‐E Morange, et. al.P.‐E Morange ... D.‐A Trégouët
01 Jan 2010
Journal of thrombosis and haemostasis : JTH | VOL. 9

Considerations for Genomewide Association Studies in Parkinson Disease
Dr Richard H Myers
American Journal of Human Genetics | VOL. 78
Dr Richard H MyersDr Richard H Myers
01 Jun 2006
American Journal of Human Genetics | VOL. 78

Identification of new schizophrenia susceptibility loci in an ethnically homogeneous, family‐based, Arab‐Israeli sample
Anna Alkelai ... Sara Lupoli
FASEB journal : official publication of the Federation of American Societies for Experimental Biology | VOL. 25
Anna Alkelai, et. al.Anna Alkelai ... Sara Lupoli
27 Jul 2011
FASEB journal : official publication of the Federation of American Societies for Experimental Biology | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics