Mining gold dust under the genome wide significance level: a two‐stage approach to analysis of GWAS

Gang Shi,Eric Boerwinkle,C Charles Gu,D.C Rao,Alanna C Morrison,Aravinda Chakravarti

doi:10.1002/gepi.20556

Abstract

We propose a two-stage approach to analyze genome-wide association data in order to identify a set of promising single-nucleotide polymorphisms (SNPs). In stage one, we select a list of top signals from single SNP analyses by controlling false discovery rate. In stage two, we use the least absolute shrinkage and selection operator (LASSO) regression to reduce false positives. The proposed approach was evaluated using simulated quantitative traits based on genome-wide SNP data on 8,861 Caucasian individuals from the Atherosclerosis Risk in Communities (ARIC) Study. Our first stage, targeted at controlling false negatives, yields better power than using Bonferroni-corrected significance level. The LASSO regression reduces the number of significant SNPs in stage two: it reduces false-positive SNPs and it reduces true-positive SNPs also at simulated causal loci due to linkage disequilibrium. Interestingly, the LASSO regression preserves the power from stage one, i.e., the number of causal loci detected from the LASSO regression in stage two is almost the same as in stage one, while reducing false positives further. Real data on systolic blood pressure in the ARIC study was analyzed using our two-stage approach which identified two significant SNPs, one of which was reported to be genome-significant in a meta-analysis containing a much larger sample size. On the other hand, a single SNP association scan did not yield any significant results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Mining gold dust under the genome wide significance level: a two‐stage approach to analysis of GWAS

Abstract

Talk to us

Similar Papers

More From: Genetic Epidemiology

Lead the way for us

Journal: Genetic Epidemiology	Publication Date: Dec 31, 2010
Citations: 49

Similar Papers

Relation of Adult-Onset Asthma to Coronary Heart Disease and Stroke
Stephen J Onufrak ... L Viola Vaccarino
The American Journal of Cardiology | VOL. 101
Stephen J Onufrak, et. al.Stephen J Onufrak ... L Viola Vaccarino
05 Mar 2008
The American Journal of Cardiology | VOL. 101

Usefulness of Ventricular Premature Complexes to Predict Coronary Heart Disease Events and Mortality (from the Atherosclerosis Risk In Communities Cohort)
Mark W Massing ... Gerardo Heiss
The American Journal of Cardiology | VOL. 98
Mark W Massing, et. al.Mark W Massing ... Gerardo Heiss
18 Oct 2006
The American Journal of Cardiology | VOL. 98

Abstract 1028: Association of plasma C-reactive protein (CRP) and CRP genetic risk score with cancer risk in the Atherosclerosis Risk in Communities (ARIC) study
Anna E Prizment ... Kala Visvanathan
Cancer Research | VOL. 72
Anna E Prizment, et. al.Anna E Prizment ... Kala Visvanathan
15 Apr 2012
Cancer Research | VOL. 72

Orthostatic Hypotension and Cardiovascular Risk
Cyndya Shibao ... Italo Biaggioni
Hypertension | VOL. 56
Cyndya Shibao, et. al.Cyndya Shibao ... Italo Biaggioni
08 Nov 2010
Hypertension | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mining gold dust under the genome wide significance level: a two‐stage approach to analysis of GWAS

Abstract

Talk to us

Similar Papers

More From: Genetic Epidemiology