Abstract

Knowledge of 1p/19q-codeletion and IDH1/2 mutational status is necessary to interpret any investigational study of diffuse gliomas in the modern era. While DNA sequencing is the gold standard for determining IDH mutational status, genome-wide methylation arrays and gene expression profiling have been used for surrogate mutational determination. Previous studies by our group suggest that 1p/19q-codeletion and IDH mutational status can be predicted by genome-wide somatic copy number alteration (SCNA) data alone, however a rigorous model to accomplish this task has yet to be established. In this study, we used SCNA data from 786 adult diffuse gliomas in The Cancer Genome Atlas (TCGA) to develop a two-stage classification system that identifies 1p/19q-codeleted oligodendrogliomas and predicts the IDH mutational status of astrocytic tumors using a machine-learning model. Cross-validated results on TCGA SCNA data showed near perfect classification results. Furthermore, our astrocytic IDH mutation model validated well on four additional datasets (AUC = 0.97, AUC = 0.99, AUC = 0.95, AUC = 0.96) as did our 1p/19q-codeleted oligodendroglioma screen on the two datasets that contained oligodendrogliomas (MCC = 0.97, MCC = 0.97). We then retrained our system using data from these validation sets and applied our system to a cohort of REMBRANDT study subjects for whom SCNA data, but not IDH mutational status, is available. Overall, using genome-wide SCNAs, we successfully developed a system to robustly predict 1p/19q-codeletion and IDH mutational status in diffuse gliomas. This system can assign molecular subtype labels to tumor samples of retrospective diffuse glioma cohorts that lack 1p/19q-codeletion and IDH mutational status, such as the REMBRANDT study, recasting these datasets as validation cohorts for diffuse glioma research.

Highlights

  • Diffuse gliomas comprise the most common adult malignant tumors of the central nervous system [49]

  • TCGA glioma dataset Somatic mutation calls for The Cancer Genome Atlas (TCGA) glioblastomas and lower-grade astrocytic and oligodendroglial tumors (N = 812) computed by the Multi-Center Mutation Calling in Multiple Cancers (MC3) project [20] were downloaded from University of California Santa Cruz (UCSC) Xena [26]

  • TCGA somatic copy number alteration (SCNA) data downloaded from UCSC Xena (UCSC hg19) was the thresholded output of the Genomic Identification of Significant Targets in Cancer 2.0 (GISTIC) algorithm aligned to human genome assembly GRCh37 [46]

Read more

Summary

Introduction

Diffuse gliomas comprise the most common adult malignant tumors of the central nervous system [49]. These adult diffuse gliomas consist of three major. Older retrospective cohorts of diffuse gliomas lacking IDH mutational and 1p/19q-codeletion status have limited utility for validating contemporary adult diffuse glioma study results. Methods have been developed to infer IDH mutation and 1p/19q-codeletion status from methylation array [10, 47], gene expression [12], and magnetic resonance imaging data [3, 35, 37, 43], which can provide surrogate molecular subtype labels for validating adult diffuse glioma study results. SCNA data has the advantage of directly encoding the extent of 1p and 19q loss, an empirical threshold necessary to definitively call 1p/19q-codeletions has yet to be established

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call