BackgroundGastrointestinal stromal tumors (GISTs) have malignant potential, and treatment varies according to risk. However, no specific protocols exist to preoperatively assess the malignant potential of gastric stromal tumors (gGISTs). This study aimed to use machine learning (ML) to develop and validate clinically relevant preoperative models to predict the malignant potential of gGISTs. MethodsWe screened patients diagnosed with gGISTs at the Affiliated Hospital of North Sichuan Medical College. We employed the Least Absolute Shrinkage and Selection Operator (LASSO) and logistic regression to identify risk factors. Subsequently, an ensemble of ML models was deployed to determine the optimal classifier. Additionally, we harnessed SHapley Additive exPlanations (SHAP) for tailored risk profiling. ResultsWe enrolled 318 patients with gGISTs. Utilizing LASSO regression and multifactorial logistic regression, we analyzed the training dataset, revealing that the presence of endoscopic ultrasound (EUS) high-risk features, tumor border clarity, tumor diameter, and monocyte-to-lymphocyte ratio (MLR) were significant predictors of high malignancy risk in gGIST. As determined by our ML approach, the logistic classification model demonstrated optimal performance, with an area under the receiver operating characteristic curve of 0.919 and 0.925 for the training and test sets, respectively. Furthermore, decision curve analysis substantiated the clinical relevance of the model. ConclusionHigh-risk EUS features, ill-defined tumor margins, larger tumor diameters, and elevated MLR independently predicted heightened malignant potential in gGIST. We developed logistic regression models based on these factors, which were further interpreted using the SHAP methodology. This analytical approach facilitated personalized therapeutic decision-making for diverse patient populations.
Read full abstract