Abstract
BackgroundGenome-wide association studies have identified hundreds of loci that influence a wide variety of complex human traits; however, little is known regarding the biological mechanism of action of these loci. The recent accumulation of functional genomics (“omics”), including metabolomics data, has created new opportunities for studying the functional role of specific changes in the genome. Functional genomic data are characterized by their high dimensionality, the presence of (strong) statistical dependency between traits, and, potentially, complex genetic control. Therefore, the analysis of such data requires specific statistical genetics methods.ResultsTo facilitate our understanding of the genetic control of omics phenotypes, we propose a trait-centered, network-based conditional genetic association (cGAS) approach for identifying the direct effects of genetic variants on omics-based traits. For each trait of interest, we selected from a biological network a set of other traits to be used as covariates in the cGAS. The network can be reconstructed either from biological pathway databases (a mechanistic approach) or directly from the data, using a Gaussian graphical model applied to the metabolome (a data-driven approach). We derived mathematical expressions that allow comparison of the power of univariate analyses with conditional genetic association analyses. We then tested our approach using data from a population-based Cooperative Health Research in the region of Augsburg (KORA) study (n = 1,784 subjects, 1.7 million single-nucleotide polymorphisms) with measured data for 151 metabolites.ConclusionsWe found that compared to single-trait analysis, performing a genetic association analysis that includes biologically relevant covariates can either gain or lose power, depending on specific pleiotropic scenarios, for which we provide empirical examples. In the context of analyzed metabolomics data, the mechanistic network approach had more power compared to the data-driven approach. Nevertheless, we believe that our analysis shows that neither a prior-knowledge-only approach nor a phenotypic-data-only approach is optimal, and we discuss possibilities for improvement.
Highlights
Genome-wide association studies have identified hundreds of loci that influence a wide variety of complex human traits; little is known regarding the biological mechanism of action of these loci
genome-wide association study (GWAS) have enabled the identification of thousands of loci, significantly increasing our understanding of the genetic basis underlying the control of complex human traits [1]
With respect to the second example, ETFDH (Fig. 2B), we found that the conditional regression of C10 on rs8396 and two covariates (C8 and C12, two medium-chain acylcarnitines) led to a smaller single-nucleotide polymorphism (SNP) partial regression coefficient compared to an unconditional regression; this is because all of the terms in k βi ρgi /ρyg are positive
Summary
Genome-wide association studies have identified hundreds of loci that influence a wide variety of complex human traits; little is known regarding the biological mechanism of action of these loci. The recent accumulation of functional genomics (“omics”), including metabolomics data, has created new opportunities for studying the functional role of specific changes in the genome. Genome-wide association studies (GWASs) are a highly popular method for identifying alleles that affect complex traits in humans, including the risk of common diseases. GWASs have enabled the identification of thousands of loci, significantly increasing our understanding of the genetic basis underlying the control of complex human traits [1]. The recent accumulation of functional genomics (or “omics” for short) data, including information regarding the levels of gene expression (the transcriptome), metabolites (the metabolome), proteins (the proteome), and glycosylation (the glycome), can provide new insight into the functional role of specific changes in the genome [2, 3]. Linking the variants that underlie these variations in metabolomics with various diseases can provide functional insight into the many disease-related associations that were reported in previous studies, including cardiovascular and kidney disease, type 2 diabetes, cancer, gout, venous thromboembolism, and Crohn’s disease [5]
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have