Abstract

The emergence of large-scale multi-omics data warrants method development for data integration. Genomic studies from cancer patients have identified epigenetic and genetic regulators – such as methylation marks, somatic mutations, and somatic copy number alterations (SCNAs), among others – as predictive features of cancer outcome. However, identification of “driver genes” associated with a given alteration remains a challenge. To this end, we developed a computational tool, iEDGE, to model cis and trans effects of (epi-)DNA alterations and identify potential cis driver genes, where cis and trans genes denote those genes falling within and outside the genomic boundaries of a given (epi-)genetic alteration, respectively. iEDGE first identifies the cis and trans gene expression signatures associated with the presence/absence of a particular epi-DNA alteration across samples. It then applies tests of statistical mediation to determine the cis genes predictive of the trans gene expression. Finally, cis and trans effects are annotated by pathway enrichment analysis to gain insights into the underlying regulatory networks. We used iEDGE to perform integrative analysis of SCNAs and gene expression data from breast cancer and 18 additional cancer types included in The Cancer Genome Atlas (TCGA). Notably, cis gene drivers identified by iEDGE were found to be significantly enriched for known driver genes from multiple compendia of validated oncogenes and tumor suppressors, suggesting that the remainder are of equal importance. Furthermore, predicted drivers were enriched for functionally relevant cancer genes with amplification-driven dependencies, which are of potential prognostic and therapeutic value. All the analyses results are accessible at https://montilab.bu.edu/iEDGE. In summary, integrative analysis of SCNAs and gene expression using iEDGE successfully identified known cancer driver genes and putative cancer therapeutic targets across 19 cancer types in the TCGA. The proposed method can easily be applied to the integration of gene expression profiles with other epi-DNA assays in a variety of disease contexts.

Highlights

  • A central goal of cancer genomics is to identify key genetic and epigenetic alterations that promote initiation and/ or progression of cancer

  • We developed a methodology that identifies somatic copy number alterations (SCNAs)- associated genes and performs prediction of cis gene drivers prioritized by their capability of transactivating downstream gene expression, by leveraging the trans-gene signature whose expression is associated with a given SCNA event of interest

  • Based on the Sobel test of mediation[8], we identify putative SCNA-associated cancer drivers as the cis gene mediating the most trans gene expression, the method allows users to customize the set of trans genes considered for the ranking of putative cis gene drivers

Read more

Summary

Introduction

A central goal of cancer genomics is to identify key genetic and epigenetic alterations that promote initiation and/ or progression of cancer. Large-scale cancer genomics data compendia such as The Cancer Genome Atlas (TCGA) have generated comprehensive multi-omics datasets for tens of thousands of patients across ~30 types of cancer[1] The availability of such large-scale datasets provides an opportunity to develop methods that integrate data from multiple types of profiling platforms, supporting the discovery of novel candidate driver genes and eventually diagnostic and prognostic biomarkers and therapeutic targets. The approach is predicated on the hypothesis that SCNA-related drivers of tumorigenesis will mediate a larger proportion of the downstream effect observable by trans gene expression than non-drivers Using this heuristic, and based on the Sobel test of mediation[8], we identify putative SCNA-associated cancer drivers as the cis gene mediating the most trans gene expression, the method allows users to customize the set of trans genes considered for the ranking of putative cis gene drivers

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call