Abstract
BackgroundGenetic profiling of cancers for variations in copy number, structure or expression of certain genes has improved diagnosis, risk-stratification and therapeutic decision-making. However the tumor-restricted nature of these changes limits their application to certain cancer types or sub-types. Tests with broader prognostic capabilities are lacking.MethodsUsing RNAseq data from 10,227 tumors in The Cancer Genome Atlas (TCGA), we evaluated 212 protein-coding transcripts from 12 cancer-related pathways. We employed t-distributed stochastic neighbor embedding (t-SNE) to identify expression pattern difference among each pathway’s transcripts. We have previously used t-SNE to show that survival in some cancers correlates with expression patterns of transcripts encoding ribosomal proteins and enzymes for cholesterol biosynthesis and fatty acid oxidation.ResultsUsing the above 212 transcripts, t-SNE-assisted transcript pattern profiling identified patient cohorts with significant survival differences in 30 of 34 different cancer types comprising 9350 tumors (91.4% of all TCGA cases). Small subsets of each pathway’s transcripts, comprising no more than 50–60 from the original group, played particularly prominent roles in determining overall t-SNE patterns. In several cases, further refinements in long-term survival could be achieved by sequential t-SNE profiling with two pathways’ transcripts, by a combination of t-SNE plus whole transcriptome profiling or by employing t-SNE on immuno-histochemically defined breast cancer subtypes. In two cancer types, individuals with Stage IV disease at presentation could be readily subdivided into groups with highly significant survival differences based on t-SNE-based tumor sub-classification.Conclusionst-SNE-assisted profiling of a small number of transcripts allows the prediction of long-term survival across multiple cancer types.
Highlights
Genetic profiling of cancers for variations in copy number, structure or expression of certain genes has improved diagnosis, risk-stratification and therapeutic decision-making
Using the machine learning algorithm “t-distributed stochastic neighbor embedding” (t-SNE) [6] we have previously demonstrated that the expression patterns of ribosomal protein transcripts (RPTs) differ among normal tissues and cancers in distinct and reproducible ways that are largely independent of absolute expression levels [7, 8]
The t-SNE clusters of individual pathways correlated with survival in 3–14 cancer types comprising 9.6–38.9% of the entire The Cancer Genome Atlas (TCGA) population (Figs. 2 and 3 and Additional file 1: Figures S13–14)
Summary
Genetic profiling of cancers for variations in copy number, structure or expression of certain genes has improved diagnosis, risk-stratification and therapeutic decision-making. Next-generation DNA and RNA sequencing, have identified gene copy number variations, recurrent mutations, rearrangements and transcript expression differences in many cancers. These may be associated with specific tumor subtypes, biological behaviors, therapeutic responses and outcomes not otherwise revealed by more traditional histologic or immuno-histochemical assessments [1,2,3,4,5]. There is a clear need to assess these parameters more globally and across multiple cancers with a common and preferably small set of genes The availability of such a test could greatly simplify and expand the molecular evaluation of cancers, further improve prognostication and therapeutic stratification and aid in decisions regarding the frequency and intensity of post-therapy follow-up. We made similar observations with transcripts encoding enzymes involved in cholesterol biosynthesis and fatty acid oxidation (FAO) [9]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.