Abstract
This study aims to develop an optimal machine learning model that uses lung equivalent uniform dose (lung EUD to predict radiation pneumonitis (RP) occurrence in lung cancer patients treated with volumetric modulated arc therapy (VMAT). We analyzed a cohort of 77 patients diagnosed with locally advanced squamous cell lung cancer (LASCLC) receiving concurrent chemoradiotherapy with VMAT. Patients were categorized based on the onset of grade II or higher radiation pneumonitis (RP 2+). Dose volume histogram data, extracted from the treatment planning system, were used to compute the lung EUD values for both groups using a specialized numerical analysis code. We identified the parameter α, representing the most significant relative difference in lung EUD between the two groups. The predictive potential of variables for RP2+, including physical dose metrics, lung EUD, normal tissue complication probability (NTCP) from the Lyman-Kutcher-Burman (LKB) model, and lung EUD-calibrated NTCP for affected and whole lung, underwent both univariate and multivariate analyses. Relevant variables were then employed as inputs for machine learning models: multiple logistic regression (MLR), support vector machine (SVM), decision tree (DT), and K-nearest neighbor (KNN). Each model's performance was gauged using the area under the curve (AUC), determining the best-performing model. The optimal α-value for lung EUD was 0.3, maximizing the relative lung EUD difference between the RP 2+ and non-RP 2+ groups. A strong correlation coefficient of 0.929 (P< 0.01) was observed between lung EUD (α = 0.3) and physical dose metrics. When examining predictive capabilities, lung EUD-based NTCP for the affected lung (AUC: 0.862) and whole lung (AUC: 0.815) surpassed LKB-based NTCP for the respective lungs. The decision tree (DT) model using lung EUD-based predictors emerged as the superior model, achieving an AUC of 0.98 in both training and validation datasets. The likelihood of developing RP 2+ has shown a significant correlation with the advancements in RT technology. From traditional 3-D conformal RT, lung cancer treatment methodologies have transitioned to sophisticated techniques like static IMRT. Accurately deriving such a dose-effect relationship through NTCP modeling of RP incidence is statistically challenging due to the increased number of degrees-of-freedom. To the best of our knowledge, many studies have not clarified the rationale behind setting the α-value to 0.99 or 1, despite the closely aligned calculated lung EUD and lung mean dose MLD. Perfect independence among variables is rarely achievable in real-world scenarios. Four prominent machine learning algorithms were used to devise our prediction models. The inclusion of lung EUD-based factors substantially enhanced their predictive performance for RP 2+. Our results advocate for the decision tree model with lung EUD-based predictors as the optimal prediction tool for VMAT-treated lung cancer patients. Which could replace conventional dosimetric parameters, potentially simplifying complex neural network structures in prediction models.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have