Penalized Logistic Regression Analysis for Genetic Association Studies of Binary Phenotypes

Ying Yu,Angela Brooks-Wilson,Siyuan Chen,Brad Mcneney,Rawnak Hoque,Olga Vishnyakova,Samantha Jean Jones

doi:10.1159/000525650

Abstract

Introduction: Increasingly, logistic regression methods for genetic association studies of binary phenotypes must be able to accommodate data sparsity, which arises from unbalanced case-control ratios and/or rare genetic variants. Sparseness leads to maximum likelihood estimators (MLEs) of log-OR parameters that are biased away from their null value of zero and tests with inflated type I errors. Different penalized likelihood methods have been developed to mitigate sparse data bias. We study penalized logistic regression using a class of log-F priors indexed by a shrinkage parameter m to shrink the biased MLE toward zero. Methods: We proposed a two-step approach to the analysis of a genetic association study: first, a set of variants that show evidence of association with the trait is used to estimate m; second, the estimated m is used for log-F-penalized logistic regression analyses of all variants using data augmentation with standard software. Our estimate of m is the maximizer of a marginal likelihood obtained by integrating the latent log-ORs out of the joint distribution of the parameters and observed data. We consider two approximate approaches to maximizing the marginal likelihood: (i) a Monte Carlo EM algorithm and (ii) a Laplace approximation to each integral, followed by derivative-free optimization of the approximation. Results: We evaluated the statistical properties of our proposed two-step method and compared its performance to other shrinkage methods by a simulation study. Our simulation studies suggest that the proposed log-F-penalized approach has lower bias and mean squared error than other methods considered. We also illustrated the approach on data from a study of genetic associations with “Super Senior” cases and middle-aged controls. Discussion/Conclusion: We have proposed a method for single rare variant analysis with binary phenotypes by logistic regression penalized by log-F priors. Our method has the advantage of being easily extended to correct for confounding due to population structure and genetic relatedness through a data augmentation approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Penalized Logistic Regression Analysis for Genetic Association Studies of Binary Phenotypes

Abstract

Talk to us

Similar Papers

More From: Human Heredity

Lead the way for us

Journal: Human Heredity	Publication Date: Jan 1, 2022
License type: CC BY-NC 4.0

Similar Papers

Sequence Kernel Association Tests for the Combined Effect of Rare and Common Variants
Iuliana Ionita-Laza ... Xihong Lin
The American Journal of Human Genetics | VOL. 92
Iuliana Ionita-Laza, et. al.Iuliana Ionita-Laza ... Xihong Lin
16 May 2013
The American Journal of Human Genetics | VOL. 92

Genomic Diversity Evaluation of Populus trichocarpa Germplasm for Rare Variant Genetic Association Studies.
Anthony Piot ... Julien Prunier
Frontiers in Genetics | VOL. 10
Anthony Piot, et. al.Anthony Piot ... Julien Prunier
28 Jan 2020
Frontiers in Genetics | VOL. 10

EEG data augmentation: towards class imbalance problem in sleep staging tasks
Jiahao Fan ... Xinyu Jiang
Journal of Neural Engineering | VOL. 17
Jiahao Fan, et. al.Jiahao Fan ... Xinyu Jiang
01 Oct 2020
Journal of Neural Engineering | VOL. 17

Integrating rare genetic variants into pharmacogenetic drug response predictions
Magnus Ingelman-Sundberg ... Yitian Zhou
Human Genomics | VOL. 12
Magnus Ingelman-Sundberg, et. al.Magnus Ingelman-Sundberg ... Yitian Zhou
25 May 2018
Human Genomics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Penalized Logistic Regression Analysis for Genetic Association Studies of Binary Phenotypes

Abstract

Talk to us

Similar Papers

More From: Human Heredity