A generalized linear mixed model association tool for biobank-scale data.

Longda Jiang,Zhili Zheng,Hailing Fang,Jian Yang

doi:10.1038/s41588-021-00954-4

Abstract

Compared with linear mixed model-based genome-wide association (GWA) methods, generalized linear mixed model (GLMM)-based methods have better statistical properties when applied to binary traits but are computationally much slower. In the present study, leveraging efficient sparse matrix-based algorithms, we developed a GLMM-based GWA tool, fastGWA-GLMM, that is severalfold to orders of magnitude faster than the state-of-the-art tools when applied to the UK Biobank (UKB) data and scalable to cohorts with millions of individuals. We show by simulation that the fastGWA-GLMM test statistics of both common and rare variants are well calibrated under the null, even for traits with extreme case-control ratios. We applied fastGWA-GLMM to the UKB data of 456,348 individuals, 11,842,647 variants and 2,989 binary traits (full summary statistics available at http://fastgwa.info/ukbimpbin ), and identified 259 rare variants associated with 75 traits, demonstrating the use of imputed genotype data in a large cohort to discover rare variants for binary complex traits.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Genetics	Publication Date: Nov 1, 2021
Citations: 298	License type: cc-by

R Discovery Prime

R Discovery Prime

A generalized linear mixed model association tool for biobank-scale data.

Abstract

Talk to us

Similar Papers

More From: Nature Genetics

Lead the way for us

Similar Papers

Sequence Kernel Association Tests for the Combined Effect of Rare and Common Variants
Iuliana Ionita-Laza ... Xihong Lin
The American Journal of Human Genetics | VOL. 92
Iuliana Ionita-Laza, et. al.Iuliana Ionita-Laza ... Xihong Lin
16 May 2013
The American Journal of Human Genetics | VOL. 92

The hidden factor: accounting for covariate effects in power and sample size computation for a binary trait.
Ziang Zhang ... Lei Sun
Bioinformatics | VOL. 39
Ziang Zhang, et. al.Ziang Zhang ... Lei Sun
21 Mar 2023
Bioinformatics | VOL. 39

Extending Rare-Variant Testing Strategies: Analysis of Noncoding Sequence and Imputed Genotypes
Matthew Zawistowski ... Sebastian Zöllner
The American Journal of Human Genetics | VOL. 87
Matthew Zawistowski, et. al.Matthew Zawistowski ... Sebastian Zöllner
01 Nov 2010
The American Journal of Human Genetics | VOL. 87

Assessment of a causal relationship between body mass index and atopic dermatitis
Ashley Budu-Aggrey ... Sara J Brown
Journal of Allergy and Clinical Immunology | VOL. 147
Ashley Budu-Aggrey, et. al.Ashley Budu-Aggrey ... Sara J Brown
17 May 2020
Journal of Allergy and Clinical Immunology | VOL. 147

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A generalized linear mixed model association tool for biobank-scale data.

Abstract

Talk to us

Similar Papers

More From: Nature Genetics