Abstract

BackgroundCancer develops when pathways controlling cell survival, cell fate or genome maintenance are disrupted by the somatic alteration of key driver genes. Understanding how pathway disruption is driven by somatic alterations is thus essential for an accurate characterization of cancer biology and identification of therapeutic targets. Unfortunately, current cancer pathway analysis methods fail to fully model the relationship between somatic alterations and pathway activity.ResultsTo address these limitations, we developed a multi-omics method for identifying biologically important pathways and genes in human cancer. Our approach combines single-sample pathway analysis with multi-stage, lasso-penalized regression to find pathways whose gene expression can be explained largely in terms of gene-level somatic alterations in the tumor. Importantly, this method can analyze case-only data sets, does not require information regarding pathway topology and supports personalized pathway analysis using just somatic alteration data for a limited number of cancer-associated genes. The practical effectiveness of this technique is illustrated through an analysis of data from The Cancer Genome Atlas using gene sets from the Molecular Signatures Database.ConclusionsNovel insights into the pathophysiology of human cancer can be obtained from statistical models that predict expression-based pathway activity in terms of non-silent somatic mutations and copy number variation. These models enable the identification of biologically important pathways and genes and support personalized pathway analysis in cases where gene expression data is unavailable.

Highlights

  • Cancer develops when pathways controlling cell survival, cell fate or genome maintenance are disrupted by the somatic alteration of key driver genes

  • We have developed a new multi-omics pathway analysis method for cancer genomic data that aims to: 1 Identify pathways that play an important role in the pathophysiology of human cancers

  • Can the models be used to identify novel driver genes? To determine if the models are effective at identifying novel driver genes, we examined the high-ranking somatic alteration predictors for genes not included in the Catalog of Somatic Mutations in Cancer (COSMIC) cancer gene census

Read more

Summary

Introduction

Cancer develops when pathways controlling cell survival, cell fate or genome maintenance are disrupted by the somatic alteration of key driver genes. High-throughput genomic assays have revolutionized our understanding of cancer Projects such as The Cancer Genome Atlas (TCGA) [1] and the Catalog of Somatic Mutations in Cancer (COSMIC) [2] have collected detailed measurements of DNA sequence, mRNA expression and methylation for thousands of individual tumors across multiple cancer types. The activity of cancer-associated genes can be impacted in a variety of ways including copy number variation, somatic mutations and methylation changes [5] Capturing all of these mechanisms requires the measurement and joint analysis of multiple types of omics data.

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call