Abstract

BackgroundGout prediction is essential for the development of individualized prevention and treatment plans. Our objective was to develop an efficient and interpretable machine learning (ML) model using the SHapley Additive exPlanation (SHAP) to link dietary fiber and triglyceride-glucose (TyG) index to predict gout.MethodsUsing datasets from the National Health and Nutrition Examination Survey (NHANES) (2005–2018) population to study dietary fiber, the TyG index was used to predict gout. After evaluating the performance of six ML models and selecting the Light Gradient Boosting Machine (LGBM) as the optimal algorithm, we interpret the LGBM model for predicting gout using SHAP and reveal the decision-making process of the model.ResultsAn initial survey of 70,190 participants was conducted, and after a gradual exclusion process, 12,645 cases were finally included in the study. Selection of the best performing LGBM model for prediction of gout associated with dietary fiber and TyG index (Area under the ROC curve (AUC): 0.823, 95% confidence interval (CI): 0.798–0.848, Accuracy: 95.3%, Brier score: 0.077). The feature importance of SHAP values indicated that age was the most important feature affecting the model output, followed by uric acid (UA). The SHAP values showed that lower dietary fiber values had a more pronounced effect on the positive prediction of the model, while higher values of the TyG index had a more pronounced effect on the positive prediction of the model.ConclusionThe interpretable LGBM model associated with dietary fiber and TyG index showed high accuracy, efficiency, and robustness in predicting gout. Increasing dietary fiber intake and lowering the TyG index are beneficial in reducing the potential risk of gout.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call