Functional annotation signatures of disease susceptibility loci improve SNP association analysis.

Edwin S Iversen,Merlise A Clyde,Gary Lipton,Alvaro Na Monteiro

doi:10.1186/1471-2164-15-398

Abstract

BackgroundGenetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Here, we demonstrate how these data can be systematically integrated into an association study’s analysis plan.ResultsWe developed a Bayesian statistical model for the prior probability of phenotype–genotype association that incorporates data from past association studies and publicly available functional annotation data regarding the susceptibility variants under study. The model takes the form of a binary regression of association status on a set of annotation variables whose coefficients were estimated through an analysis of associated SNPs in the GWAS Catalog (GC). The functional predictors examined included measures that have been demonstrated to correlate with the association status of SNPs in the GC and some whose utility in this regard is speculative: summaries of the UCSC Human Genome Browser ENCODE super–track data, dbSNP function class, sequence conservation summaries, proximity to genomic variants in the Database of Genomic Variants and known regulatory elements in the Open Regulatory Annotation database, PolyPhen–2 probabilities and RegulomeDB categories. Because we expected that only a fraction of the annotations would contribute to predicting association, we employed a penalized likelihood method to reduce the impact of non–informative predictors and evaluated the model’s ability to predict GC SNPs not used to construct the model. We show that the functional data alone are predictive of a SNP’s presence in the GC. Further, using data from a genome–wide study of ovarian cancer, we demonstrate that their use as prior data when testing for association is practical at the genome–wide scale and improves power to detect associations.ConclusionsWe show how diverse functional annotations can be efficiently combined to create ‘functional signatures’ that predict the a priori odds of a variant’s association to a trait and how these signatures can be integrated into a standard genome–wide–scale association analysis, resulting in improved power to detect truly associated variants.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-398) contains supplementary material, which is available to authorized users.

Highlights

Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype
We show that functional signatures so derived are predictive of the association status of SNPs not used in their creation and that, when coupled with genetic association data following the method we describe, improve the efficiency of association testing in a genome–wide association study (GWAS) study of ovarian cancer
We focus on the case–control study design for purposes of illustrating integration of the a priori models for functional annotation data we describe below into analyses of genetic association data

Summary

Introduction

Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. Functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Functional annotation data have rarely played more than an indirect role in assessing evidence for association. The prevailing approach to this is via a two–staged hierarchical model in which coefficients in the stage I generalized linear model for phenotype given genotype and exposure measurements are regressed, in stage II, on the annotation data [3,4,5,6] This is limited to analysis of a modest number of variants and does not make use of prior data derived from previous association studies to inform the nature of that relationship

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Jan 1, 2014
Citations: 60	License type: cc-by

R Discovery Prime

R Discovery Prime

Functional annotation signatures of disease susceptibility loci improve SNP association analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Annotation Regression for Genome-Wide Association Studies with an Application to Psychiatric Genomic Consortium Data.
Sunyoung Shin ... Sündüz Keleş
Statistics in Biosciences | VOL. 9
Sunyoung Shin, et. al.Sunyoung Shin ... Sündüz Keleş
01 Jun 2017
Statistics in Biosciences | VOL. 9

Graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data.
Qiaolan Deng ... Won Chang
Frontiers in genetics | VOL. 14
Qiaolan Deng, et. al.Qiaolan Deng ... Won Chang
12 Jul 2023
Frontiers in genetics | VOL. 14

Integrating functional data to prioritize causal variants in statistical fine-mapping studies.
Gleb Kichaev ... Sara Lindstrom
PLoS Genetics | VOL. 10
Gleb Kichaev, et. al.Gleb Kichaev ... Sara Lindstrom
30 Oct 2014
PLoS Genetics | VOL. 10

Incorporating functional annotation information in prioritizing disease associated SNPs from genome wide association studies.
Lin Hou ... Hongyu Zhao
Science China. Life sciences | VOL. 57
Lin Hou, et. al.Lin Hou ... Hongyu Zhao
17 Oct 2014
Science China. Life sciences | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Functional annotation signatures of disease susceptibility loci improve SNP association analysis.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics