Abstract

The treatment of complex and multifactorial diseases constitutes a big challenge in day-to-day clinical practice. As many parameters influence clinical phenotypes, accurate diagnosis and prompt therapeutic management is often difficult. Significant research and investment focuses on state-of-the-art genomic and metagenomic analyses in the burgeoning field of Precision (or Personalized) Medicine with genome-wide-association-studies (GWAS) helping in this direction by linking patient genotypes at specific polymorphic sites (single-nucleotide polymorphisms, SNPs) to the specific phenotype. The generation of polygenic risk scores (PRSs) is a relatively novel statistical method that associates the collective genotypes at many of a person’s SNPs to a trait or disease. As GWAS sample sizes increase, PRSs may become a powerful tool for prevention, early diagnosis and treatment. However, the complexity and multidimensionality of genetic and environmental contributions to phenotypes continue to pose significant challenges for the clinical, broad-scale use of PRSs. To improve the value of PRS measures, we propose a novel pipeline which might better utilize GWAS results and improve the utility of PRS when applied to Alzheimer’s Disease (AD), as a paradigm of multifactorial disease with existing large GWAS datasets that have not yet achieved significant clinical impact. We propose a refined approach for the construction of AD PRS improved by (1), taking into consideration the genetic loci where the SNPs are located, (2) evaluating the post-translational impact of SNPs on coding and non-coding regions by focusing on overlap with open chromatin data and SNPs that are expression quantitative trait loci (QTLs), and (3) scoring and annotating the severity of the associated clinical phenotype into the PRS. Open chromatin and eQTL data need to be carefully selected based on tissue/cell type of origin (e.g., brain, excitatory neurons). Applying such filters to traditional PRS on GWAS studies of complex diseases like AD, can produce a set of SNPs weighted according to our algorithm and a more useful PRS. Our proposed methodology may pave the way for new applications of genomic machine and deep learning pipelines to GWAS datasets in an effort to identify novel clinically useful genetic biomarkers for complex diseases like AD.

Highlights

  • Since the human genome was first sequenced, thousands of genetic variants have been associated with biological functions and diseases [1]

  • Even though controlling for linkage disequilibrium (LD) is implemented in the polygenic risk scores (PRSs) calculation, we propose a functionbased enhancement to address this bias by taking gene loci filtering into consideration

  • We propose that all single-nucleotide polymorphisms (SNPs) that might be included in an Alzheimer’s Disease (AD) PRS

Read more

Summary

Introduction

Since the human genome was first sequenced, thousands of genetic variants have been associated with biological functions and diseases [1]. This is done after selecting variants exceeding a chosen statistical confidence threshold while commonly many such thresholds are tested to identify the one exhibiting optimal performance. While the sample sizes in the aforementioned studies were larger than what is currently available for AD, they provide proof of principle that as AD GWASs become larger and more reliable, PRS is poised to become of high importance in clinical practice, especially when combined with other risk factors [27] It has already been shown in postmortem diagnosed sporadic early-onset AD that the predictive ability of identifying cases and controls is better when using PRS than the APOE locus alone, with a calculated accuracy of. As part of this review, we propose an innovative pipeline to produce a novel set of SNPs that improve the PRS calculation and the explained variance through the following analytical steps

Approach to Calculating an Improved PRS
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call