False discovery rate control in genome-wide association studies with population structure

Matteo Sesia,Stephen Bates,Emmanuel Candès,Jonathan Marchini,Chiara Sabatti

doi:10.1073/pnas.2105841118

Abstract

We present a comprehensive statistical framework to analyze data from genome-wide association studies of polygenic traits, producing interpretable findings while controlling the false discovery rate. In contrast with standard approaches, our method can leverage sophisticated multivariate algorithms but makes no parametric assumptions about the unknown relation between genotypes and phenotype. Instead, we recognize that genotypes can be considered as a random sample from an appropriate model, encapsulating our knowledge of genetic inheritance and human populations. This allows the generation of imperfect copies (knockoffs) of these variables that serve as ideal negative controls, correcting for linkage disequilibrium and accounting for unknown population structure, which may be due to diverse ancestries or familial relatedness. The validity and effectiveness of our method are demonstrated by extensive simulations and by applications to the UK Biobank data. These analyses confirm our method is powerful relative to state-of-the-art alternatives, while comparisons with other studies validate most of our discoveries. Finally, fast software is made available for researchers to analyze Biobank-scale datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Sep 27, 2021
Citations: 46	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

False discovery rate control in genome-wide association studies with population structure

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Similar Papers

Stratified false discovery control for large‐scale hypothesis testing with application to genome‐wide association studies
Lei Sun ... Shelley B Bull
Genetic Epidemiology | VOL. 30
Lei Sun, et. al.Lei Sun ... Shelley B Bull
23 Jun 2006
Genetic Epidemiology | VOL. 30

Penalized multimarker vs. single-marker regression methods for genome-wide association studies of quantitative traits.
Hui Yi ... Yongmei Liu
Genetics | VOL. 199
Hui Yi, et. al.Hui Yi ... Yongmei Liu
28 Oct 2014
Genetics | VOL. 199

Evaluating FDR and stratified FDR control approaches for high-throughput biological studies
Jinfeng Zou ... Guini Hong
-
Jinfeng Zou, et. al.Jinfeng Zou ... Guini Hong
01 Jun 2012
01 Jun 2012

Controlling the False Discovery Rate with Constraints: The Newman‐Keuls Test Revisited
Juliet Popper Shaffer
Biometrical Journal | VOL. 49
Juliet Popper ShafferJuliet Popper Shaffer
31 Jan 2007
Biometrical Journal | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

False discovery rate control in genome-wide association studies with population structure

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America