BackgroundIntratumor heterogeneity (ITH) plays a crucial role in tumor progression, relapse, immune evasion, and drug resistance. Existing ITH quantification methods based on a single molecular level are inadequate to capture ITH evolving from genotype to phenotype. MethodsWe designed a set of information entropy (IE)-based algorithms for quantifying ITH at the genome (somatic copy number alterations and mutations), mRNA, microRNA (miRNA), long non-coding RNA (lncRNA), protein, and epigenome level, respectively. We evaluated the performance of these algorithms by analyzing the correlations between their ITH scores and ITH-associated molecular and clinical features in 33 TCGA cancer types. Moreover, we evaluated the correlations between the ITH measures at different molecular levels by Spearman correlation and clustering analysis. ResultsThe IE-based ITH measures had significant correlations with unfavorable prognosis, tumor progression, genomic instability, antitumor immunosuppression, and drug resistance. The mRNA ITH showed stronger correlations with the miRNA, lncRNA, and epigenome ITH than with the genome ITH, supporting the regulatory relationships of miRNA, lncRNA, and DNA methylation towards mRNA. The protein-level ITH displayed stronger correlations with the transcriptome-level ITH than with the genome-level ITH, supporting the central dogma of molecular biology. Clustering analysis based on the ITH scores identified four subtypes of pan-cancer showing significantly different prognosis. Finally, the ITH integrating the seven ITH measures displayed more prominent properties of ITH than that at a single level. ConclusionsThis analysis provides landscapes of ITH at various molecular levels. Combining the ITH observation from different molecule levels will improve personalized management for cancer patients.
Read full abstract