Abstract Background: Advancements in multi-omics data integration and explainable Machine Learning (ML) have shown promise in precision oncology. Multi-omic data used to train ML models may include genomics, transcriptomics and histopathology to characterize cancer cells and the tumor microenvironment (TME). Explainability methods, such as SHAP, have enabled researchers and clinicians to unravel the decision-making rationale of ML models predicting cancer progression and treatment response. We developed an explainable ML framework that incorporates multi-omic features of cancer and the TME. This framework was applied to predict patient response to neoadjuvant chemotherapy (NAC) in breast cancer and immune checkpoint inhibitor (ICI) in melanoma. Methods: For breast cancer, we used the cohort from Sammut et al. [1] (n=157 training, n=75 test). For melanoma, we assembled a cohort comprising 229 patients (n=138 training, n=53 test cutaneous, n=38 test non-cutaneous) from five independent studies. We improved the performance of the ensemble ML models in Sammut et al. [1] by implementing a shared-learning architecture to enable component models to influence each other as training progresses. We applied this ensemble (Ens:LR+RF+SVM) to predict NAC response in breast cancer and ICI response in melanoma by integrating clinical, DNA sequencing, RNA sequencing and histopathology (only for breast cancer) data. For melanoma, we also trained three single ML models (LR, RF, and SVM) and another ensemble (Ens:LR+RF), and introduced a novel dual utility of SHAP for feature-selection during training and biomarker threshold identification during validation. Results: The ensemble model trained on the multi-omic breast cancer features achieved ROC-AUC of 0.88 and showed a potential 25% reduction in false positives (i.e., incorrect predictions of good response) compared to its predecessor from Sammut et al. [1]. In the melanoma cohort, the Ens:LR+RF model achieved ROC-AOC of 0.77 but was outperformed by the RF model, ROC-AUC 0.78. SHAP revealed unique interactions between each ML model and the feature space, resulting in distinct training feature sets per model. During validation, the intersection between feature values and SHAP scores revealed numerical thresholds underpinning good versus poor responses of clinically meaningful biomarkers such as neoantigen load (>2.25 good, <2.25 poor, values in log10 scale). Across these two studies, we developed and open-sourced a scalable and versatile ML workflow (xML-workFLow) for rapid experimentation in biomedical research. Conclusions: This work showcases the potential of multi-omics explainable ML in advancing precision oncology to improve treatment outcome prediction. With further experimental validation, the use of explainable ML to determine numerical thresholds could guide the development of companion diagnostics and inform combination therapeutic strategies.
Read full abstract- All Solutions
Editage
One platform for all researcher needs
Paperpal
AI-powered academic writing assistant
R Discovery
Your #1 AI companion for literature search
Mind the Graph
AI tool for graphics, illustrations, and artwork
Journal finder
AI-powered journal recommender
Unlock unlimited use of all AI tools with the Editage Plus membership.
Explore Editage Plus - Support
Overview
1033 Articles
Published in last 50 years
Related Topics
Articles published on Neoadjuvant Chemotherapy Response
Authors
Select Authors
Journals
Select Journals
Duration
Select Duration
972 Search results
Sort by Recency