Abstract

Molecular quantitative trait locus (QTL) analyses are increasingly popular to explore the genetic architecture of complex traits, but existing studies do not leverage shared regulatory patterns and suffer from a large multiplicity burden, which hampers the detection of weak signals such as trans associations. Here, we present a fully multivariate proteomic QTL (pQTL) analysis performed with our recently proposed Bayesian method LOCUS on data from two clinical cohorts, with plasma protein levels quantified by mass-spectrometry and aptamer-based assays. Our two-stage study identifies 136 pQTL associations in the first cohort, of which >80% replicate in the second independent cohort and have significant enrichment with functional genomic elements and disease risk loci. Moreover, 78% of the pQTLs whose protein abundance was quantified by both proteomic techniques are confirmed across assays. Our thorough comparisons with standard univariate QTL mapping on (1) these data and (2) synthetic data emulating the real data show how LOCUS borrows strength across correlated protein levels and markers on a genome-wide scale to effectively increase statistical power. Notably, 15% of the pQTLs uncovered by LOCUS would be missed by the univariate approach, including several trans and pleiotropic hits with successful independent validation. Finally, the analysis of extensive clinical data from the two cohorts indicates that the genetically-driven proteins identified by LOCUS are enriched in associations with low-grade inflammation, insulin resistance and dyslipidemia and might therefore act as endophenotypes for metabolic diseases. While considerations on the clinical role of the pQTLs are beyond the scope of our work, these findings generate useful hypotheses to be explored in future research; all results are accessible online from our searchable database. Thanks to its efficient variational Bayes implementation, LOCUS can analyze jointly thousands of traits and millions of markers. Its applicability goes beyond pQTL studies, opening new perspectives for large-scale genome-wide association and QTL analyses.Diet, Obesity and Genes (DiOGenes) trial registration number: NCT00390637.

Highlights

  • Questioning the genetic contribution to human diseases has become a critical step towards predicting health risks and developing effective therapies [1,2,3]

  • We illustrate and exploit the advantages of our recently introduced Bayesian framework LOCUS in a fully multivariate proteomic QTL (pQTL) study, with 300K tag SNPs and 100 − 1, 000 plasma protein levels measured by two distinct technologies

  • LOCUS identifies novel pQTLs that replicate in an independent cohort, confirms signals documented in studies 2 − 18 times larger, and detects more pQTLs than a conventional two-stage univariate analysis of our datasets

Read more

Summary

Introduction

Questioning the genetic contribution to human diseases has become a critical step towards predicting health risks and developing effective therapies [1,2,3]. While eQTL studies are routinely performed, pQTL studies have emerged only recently [4,5,6,7,8,9]. These studies allow the exploration of the genetic bases of several diseases, as certain proteins may act as proxies for specific clinical endpoints [10]. The clinical data complementing QTL data are often very limited, restricting subsequent investigation to external information from unrelated populations, health status or study designs, and rendering some degree of speculation unavoidable

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call