Radiomics has been widely recognized for its effectiveness in decoding tumor phenotypes through the extraction of quantitative imaging features. However, the robustness of radiomic methods to estimate clinically relevant biomarkers non-invasively remains largely untested. In this study, we propose Cascaded Data Processing Network (CDPNet), a radiomic feature learning method to predict tumor molecular status from medical images. We apply CDPNet to an epigenetic case, specifically targeting the estimation of O6-methylguanine-DNA-methyltransferase (MGMT) promoter methylation from Magnetic Resonance Imaging (MRI) scans of glioblastoma patients. CDPNet has three components: 1) Principal Component Analysis (PCA), 2) Fisher Linear Discriminant (FLD), and 3) a combination of hashing and blockwise histograms. The outlined architectural framework capitalizes on PCA to reconstruct input image patches, followed by FLD to extract discriminative filter banks, and finally using binary hashing and blockwise histogram module for indexing, pooling, and feature generation. To validate the effectiveness of CDPNet, we conducted an exhaustive evaluation on a comprehensive retrospective cohort comprising 484 IDH-wildtype glioblastoma patients with pre-operative multi-parametric MRI scans (T1, T1-Gd, T2, and T2-FLAIR). The prediction of MGMT promoter methylation status was cast as a binary classification problem. The developed model underwent rigorous training via 10-fold cross-validation on a discovery cohort of 446 patients. Subsequently, the model's performance was evaluated on a distinct and previously unseen replication cohort of 38 patients. Our method achieved an accuracy of 70.11% and an area under the curve of 0.71 (95% CI: 0.65 - 0.74).
Read full abstract