POLARIS: Polygenic LD-adjusted risk score approach for set-based analysis of GWAS data.

Emily Baker,Karl Michael Schmidt,Peter Holmans,Rebecca Sims,Julie Williams,Valentina Escott-Price,With The Gerad Consortium,Michael C O'Donovan

doi:10.1002/gepi.22117

Emily Baker, Karl Michael Schmidt + Show 6 more

Open Access

PDF Available

https://doi.org/10.1002/gepi.22117

Copy DOI

Export

Save

Cite

Journal: Genetic Epidemiology	Publication Date: Mar 12, 2018
Citations: 25	License type: CC BY 4.0

Affiliation: Medical Research Council, Cardiff University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

ABSTRACTPolygenic risk scores (PRSs) are a method to summarize the additive trait variance captured by a set of SNPs, and can increase the power of set‐based analyses by leveraging public genome‐wide association study (GWAS) datasets. PRS aims to assess the genetic liability to some phenotype on the basis of polygenic risk for the same or different phenotype estimated from independent data. We propose the application of PRSs as a set‐based method with an additional component of adjustment for linkage disequilibrium (LD), with potential extension of the PRS approach to analyze biologically meaningful SNP sets. We call this method POLARIS: POlygenic Ld‐Adjusted RIsk Score. POLARIS identifies the LD structure of SNPs using spectral decomposition of the SNP correlation matrix and replaces the individuals' SNP allele counts with LD‐adjusted dosages. Using a raw genotype dataset together with SNP effect sizes from a second independent dataset, POLARIS can be used for set‐based analysis. MAGMA is an alternative set‐based approach employing principal component analysis to account for LD between markers in a raw genotype dataset. We used simulations, both with simple constructed and real LD‐structure, to compare the power of these methods. POLARIS shows more power than MAGMA applied to the raw genotype dataset only, but less or comparable power to combined analysis of both datasets. POLARIS has the advantages that it produces a risk score per person per set using all available SNPs, and aims to increase power by leveraging the effect sizes from the discovery set in a self‐contained test of association in the test dataset.

Highlights

Polygenic risk scores (PRSs) are widely used for a variety of purposes in assessing the genetic liability to disorders or more general phenotypes
We present a method for accounting for linkage disequilibrium (LD) in the calculation of a PRS
The resulting individual LDadjusted PRS can be used for analyzing whether a set of single nucleotide polymorphisms (SNPs) is associated with disease

Summary

Introduction

Polygenic risk scores (PRSs) are widely used for a variety of purposes in assessing the genetic liability to disorders or more general phenotypes. These include sample stratification, risk prediction, and the detection of relationships between different subphenotypes (see, e.g., Allardyce et al, 2017; Escott-Price et al, 2015, and Foley et al, 2017, respectively). The PRS method can be adapted to partition the polygenic risk based on meaningful SNP sets, such as genes or biological pathways, and to determine whether a set of SNPs, weighted with their individual genetic risk effects, is associated at the whole-genome or set-specific levels.

Objectives

Methods

Results

Conclusion