Accurate prognostication of overall survival (OS) for non-small cell lung cancer (NSCLC) patients receiving definitive radiotherapy (RT) is crucial for developing personalized treatment strategies. This study aims to construct an interpretable prognostic model that combines radiomic features extracted from normal lung and from primary tumor with clinical parameters. Our model aimed to clarify the complex, nonlinear interactions between these variables and enhance prognostic accuracy. We included 661 stage III NSCLC patients from three multi-national datasets: a training set (N = 349), test-set-1 (N = 229), and test-set-2 (N = 83), all undergoing definitive RT. A total of 104 distinct radiomic features were separately extracted from the regions of interest in the lung and the tumor. We developed four predictive models using eXtreme gradient boosting and selected the top 10 features based on the Shapley additive explanations (SHAP) values. These models were the tumor radiomic model (Model-T), lung radiomic model (Model-L), a combined radiomic model (Model-LT), and an integrated model incorporating clinical parameters (Model-LTC). Model performance was evaluated through Harrell's concordance index, Kaplan-Meier survival curves, time-dependent area under the receiver operating characteristic curve (AUC), calibration curves, and decision curve analysis. Interpretability was assessed using the SHAP framework. Model-LTC exhibited superior performance, with notable predictive accuracy (C-index: training set, 0.87; test-set-2, 0.76) and time-dependent AUC above 0.75. Complex nonlinear relationships and interactions were evident among the model's variables. The integration of radiomic and clinical factors within an interpretable framework significantly improved OS prediction. The SHAP analysis provided insightful interpretability, enhancing the model's clinical applicability and potential for aiding personalized treatment decisions.
Read full abstract