Abstract

Altered cellular metabolism is an important characteristic and driver of cancer. Surprisingly, however, we find here that aggregating individual gene expression using canonical metabolic pathways fails to enhance the classification of noncancerous vs. cancerous tissues and the prediction of cancer patient survival. This supports the notion that metabolic alterations in cancer rewire cellular metabolism through unconventional pathways. Here we present MCF (Metabolic classifier and feature generator), which incorporates gene expression measurements into a human metabolic network to infer new cancer-mediated pathway compositions that enhance cancer vs. adjacent noncancerous tissue classification across five different cancer types. MCF outperforms standard classifiers based on individual gene expression and on canonical human curated metabolic pathways. It successfully builds robust classifiers integrating different datasets of the same cancer type. Reassuringly, the MCF pathways identified lead to metabolites known to be associated with the pertaining specific cancer types. Aggregating gene expression through MCF pathways leads to markedly better predictions of breast cancer patients’ survival in an independent cohort than using the canonical human metabolic pathways (C-index = 0.69 vs. 0.52, respectively). Notably, the survival predictive power of individual MCF pathways strongly correlates with their power in predicting cancer vs. noncancerous samples. The more predictive composite pathways identified via MCF are hence more likely to capture key metabolic alterations occurring in cancer than the canonical pathways characterizing healthy human metabolism.

Highlights

  • In recent years the study of cancer metabolism gained renewed interest as means to understand cancer’s emergence, pathophysiology, and for finding candidate targets for therapeutics [1,2,3,4,5,6]

  • We observe that aggregating individual gene expression using canonical human metabolic pathways frequently fails to enhance the classification

  • We overlaid gene expression data derived from 3611 samples across ten datasets of five cancer types onto canonical metabolic pathways defined in the RECON1 human metabolic model [20] and quantified the expression of every metabolic pathway based on the sum of the expression of all genes associated with this pathway (Methods, which in this case yields better performance than using the mean expression)

Read more

Summary

Author Summary

Cancer proliferating cells adapt their metabolism to support the conversion of available nutrients into biomass, which often involves an increased rate of specific metabolic pathways, such as glycolysis. We observe that aggregating individual gene expression using canonical human metabolic pathways frequently fails to enhance the classification. Of noncancerous vs cancerous tissues and in the task of predicting cancer patient survival. This supports the notion that metabolic alterations in cancer rewire cellular metabolism through unconventional pathways. We introduce a novel algorithm (MCF) that aims to identify these cancer-mediated ‘composite’ metabolic pathways by identifying those that best differentiate between cancerous vs non-cancerous tissues gene expression. MCF successfully builds robust classifiers integrating different datasets of the same cancer type. We further show that the data-driven pathways identified by MCF, in contrast to the canonical literature-based pathways, successfully generate clinically relevant features that are predictive of breast cancer patients’ survival in an independent dataset. Our findings suggest that cancer metabolism may be rewired via non-standard composite pathways

Introduction
Results
Identify seed reporter metabolites
Building an SVM classifier
Discussion
Materials and Methods
Evaluation of classifiers
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call