People with traumatic brain injury (TBI) are at high risk for infection and sepsis. The aim of the study was to develop and validate an explainable machine learning(ML) model based on clinical features for early prediction of the risk of sepsis in TBI patients. We enrolled all patients with TBI in the Medical Information Mart for Intensive Care IV database from 2008 to 2019. All patients were randomly divided into a training set (70%) and a test set (30%). The univariate and multivariate regression analyses were used for feature selection. Six ML methods were applied to develop the model. The predictive performance of different models were determined based on the area under the curve (AUC) and calibration curves in the test cohort. In addition, we selected the eICU Collaborative Research Database version 1.2 as the external validation dataset. Finally, we used the Shapley additive interpretation to account for the effects of features attributed to the model. Of the 1555 patients enrolled in the final cohort, 834 (53.6%) patients developed sepsis after TBI. Six variables were associated with concomitant sepsis and were used to develop ML models. Of the 6 models constructed, the Extreme Gradient Boosting (XGB) model achieved the best performance with an AUC of 0.807 and an accuracy of 74.5% in the internal validation cohort, and an AUC of 0.762 for the external validation. Feature importance analysis revealed that use mechanical ventilation, SAPSII score, use intravenous pressors, blood transfusion on admission, history of diabetes, and presence of post-stroke sequelae were the top six most influential features of the XGB model. As shown in the study, the ML model could be used to predict the occurrence of sepsis in patients with TBI in the intensive care unit.
Read full abstract