Abstract

Genomewide association studies (GWAS) sometimes identify loci at which both the number and identities of the underlying causal variants are ambiguous. In such cases, statistical methods that model effects of multiple single-nucleotide polymorphisms (SNPs) simultaneously can help disentangle the observed patterns of association and provide information about how those SNPs could be prioritized for follow-up studies. Current multi-SNP methods, however, tend to assume that SNP effects are well captured by additive genetics; yet when genetic dominance is present, this assumption translates to reduced power and faulty prioritizations. We describe a statistical procedure for prioritizing SNPs at GWAS loci that efficiently models both additive and dominance effects. Our method, LLARRMA-dawg, combines a group LASSO procedure for sparse modeling of multiple SNP effects with a resampling procedure based on fractional observation weights. It estimates for each SNP the robustness of association with the phenotype both to sampling variation and to competing explanations from other SNPs. In producing an SNP prioritization that best identifies underlying true signals, we show the following: our method easily outperforms a single-marker analysis; when additive-only signals are present, our joint model for additive and dominance is equivalent to or only slightly less powerful than modeling additive-only effects; and when dominance signals are present, even in combination with substantial additive effects, our joint model is unequivocally more powerful than a model assuming additivity. We also describe how performance can be improved through calibrated randomized penalization, and discuss how dominance in ungenotyped SNPs can be incorporated through either heterozygote dosage or multiple imputation.

Highlights

  • Genomewide association studies (GWAS) have been highly successful at identifying short chromosomal regions harboring causal variants affecting complex disease

  • In this paper we propose a multi-single nucleotide polymorphisms (SNPs) method that uses joint modeling of additive and dominance effects to characterize more accurately the genetic architecture of loci identified in standard GWAS, producing a reprioritization of SNPs that is enriched for true signals relative to single marker and additive-only multi-SNP analyses

  • We start by describing a standard linear model for estimating the additive and dominance effects of m SNPs at a broad genomic locus identified by GWAS of a quantitative outcome in n individuals; we describe a statistical procedure to identify a subset of mq SNPs that might best represent the underlying causal variants using penalized regression combined with a resampling procedure

Read more

Summary

Introduction

Genomewide association studies (GWAS) have been highly successful at identifying short chromosomal regions (loci) harboring causal variants affecting complex disease. The pattern of association and LD structure is so complex that both the number and identities of such best representative SNPs are ambiguous [e.g., Strange et al 2010]; this ambiguity is problematic because it provides a poorly informed starting point for subsequent experimental or annotation-based follow-up At these more complex loci, there is a compelling case for reanalysis using procedures that model multiple SNP effects simultaneously. In theory, such multi-SNP methods should lead to more robust estimates of each SNP’s effect and should better distinguish which SNPs represent independent signals worthy of subsequent prioritization [Stephens and Balding 2009; Wang et al 2012]. This may be in part because such methods, in balancing a more complicated set of statistical priorities, make trade offs between computational convenience and biological comprehensiveness that can seem severe, arbitrary, and/or omitting of important features

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call