Abstract Background: Intra-tumor heterogeneity (ITH) plays a pivotal role in driving breast cancer progression and therapeutic resistance. Emerging evidence has indicated that the extent of genetic heterogeneity may serve as a clinically useful biomarker. While several studies have suggested the prognostic value of ITH in several cancer types, the clinical significance of genetic ITH and molecular portraits that correlated with different ITH levels were poorly understood in breast cancer. The establishment of algorithms estimating genetic ITH based on sequencing of bulk tumor DNA offered us an opportunity to explore the clinical implication of ITH in large breast cancer cohorts and, for the first time, to use integrative genomic analyses to reveal molecular portraits related to intra-tumor genetic heterogeneity. Methods: We assessed 916 female breast cancer patients from The Cancer Genome Atlas. Mutant-allele tumor heterogeneity (MATH) values were calculated from whole-exome sequencing data. We used integers nearest to the tertiles of the MATH values as cutoff points to divide the patients into three groups nearly equal in size. The association between MATH value and clinical characteristics was evaluated, followed by survival analyses in these different MATH groups. We then compared the rates of total non-silent somatic mutations among the different MATH groups, and further determined the mutations independently associated with high MATH by logistic regression adjusting for T classification and clinical subtypes. Similar methods, superadding somatic copy number alteration (SCNA) burden in logistic model, were used to evaluate SCNA events that were significantly associated with high MATH level. Gene enrichment between the high and rest MATH groups was analyzed using Gene Set Enrichment Analysis. Results: The patients were divided into low (MATH value lower than 33), intermediate (MATH between 33 and 46) and high (MATH higher than 46) MATH groups. High T stage, African American race, and triple-negative or basal-like subtype were associated with a higher MATH level (all P<0.001). In hormone receptor-positive and human epidermal growth factor receptor-negative patients, the high MATH group showed a tendency toward a worse overall survival (P=0.052); however, while in triple-negative breast cancer, both high and low MATH indicated a worse outcome (P=0.032). Furthermore, the TP53 mutation rate increased as MATH was elevated (P<0.001), whereas CDH1 mutations were correlated with a lower level of MATH (P=0.002). Several focal and arm-level SCNA events were more common in the high MATH group, including Chr8q24 with only the MYC gene in the “peak” region. Similarly, high MATH was associated with gene set enrichment related to the MYC pathway and proliferation. Conclusion: Our study extended the knowledge concerning the clinical role of ITH in breast cancer, especially the distinct pattern of prognostic values in different clinical subtypes, which may help promote the clinical utilization of genetic ITH. Our attempt at exploring the molecular features related to ITH might provide clues for the source and consequences of ITH, inspiring subsequent experiments investigating the laws underling tumor heterogeneity. Citation Format: Ma D, Jiang Y-Z, Liu X-Y, Liu Y-R, Yu K-D, Shao Z-M. Clinical and molecular relevance of intra-tumor genetic heterogeneity in breast cancer: Integrative analysis of data from The Cancer Genome Atlas [abstract]. In: Proceedings of the 2016 San Antonio Breast Cancer Symposium; 2016 Dec 6-10; San Antonio, TX. Philadelphia (PA): AACR; Cancer Res 2017;77(4 Suppl):Abstract nr P1-06-07.
Read full abstract