Machine Learning Interpretability in Diabetes RiskAssessment: A SHAP Analysis

Mustafa Kutlu,Turker Berk Donmez,Chris Freeman

doi:10.69882/adba.cem.2024075

Abstract

Diabetes continues to be a complicated and prevalent metabolic illness, providing a serious burden to public health. While machine learning approaches like extreme gradient boosting (XGBoost) provide intriguing options for diabetes prediction, their 'black-box' nature typically limits clinical interpretability. To overcome this gap, our work applied SHapley Additive exPlanations (SHAP) to give insights into the XGBoost model's predictions. The dataset utilized in this research comprised of 253,680 patients and contained 21 parameters, such as General Health Status, High Blood Pressure Status, Age, and Body Mass Index. After feature selection using Recursive Feature Elimination (RFE), 15 important characteristics were discovered. In the test set, the XGBoost model obtained an accuracy of 86.6%, precision of 54.1%, recall of 17.0%, and an F1-score of 25.9% for the Original dataset. For the RFE dataset, the model displayed an accuracy of 86.6\%, precision of 54.9%, recall of 16.5%, and an F1-score of 25.3%. SHAP analysis found that General Health Status, High Blood Pressure Status, Age, and Body Mass Index were the most important characteristics in both the Original and RFE datasets. This work provides as a platform for transparent and clinically applicable predictive modeling, assisting in early diabetes identification and preventive healthcare.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Machine Learning Interpretability in Diabetes RiskAssessment: A SHAP Analysis

Abstract

Talk to us

Similar Papers

More From: Computers and Electronics in Medicine

Lead the way for us

Similar Papers

Creating machine learning models that interpretably link systemic inflammatory index, sex steroid hormones, and dietary antioxidants to identify gout using the SHAP (SHapley Additive exPlanations) method.
Shunshun Cao ... Yangyang Hu
Frontiers in Immunology | VOL. 15
Shunshun Cao, et. al.Shunshun Cao ... Yangyang Hu
01 May 2024
Frontiers in Immunology | VOL. 15

Development and validation of an interpretable machine learning for mortality prediction in patients with sepsis.
Bihua He ... Zheng Qiu
Frontiers in artificial intelligence | VOL. 7
Bihua He, et. al.Bihua He ... Zheng Qiu
08 Jul 2024
Frontiers in artificial intelligence | VOL. 7

Construction and validation of prognostic models in critically Ill patients with sepsis-associated acute kidney injury: interpretable machine learning approach
Zhiyan Fan ... Chen Xiao
Journal of Translational Medicine | VOL. 21
Zhiyan Fan, et. al.Zhiyan Fan ... Chen Xiao
22 Jun 2023
Journal of Translational Medicine | VOL. 21

Interpretable machine learning models for predicting short-term prognosis in AChR-Ab+ generalized myasthenia gravis using clinical features and systemic inflammation index.
Yanan Xu ... Liqin Luan
Frontiers in neurology | VOL. 15
Yanan Xu, et. al.Yanan Xu ... Liqin Luan
09 Oct 2024
Frontiers in neurology | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Machine Learning Interpretability in Diabetes RiskAssessment: A SHAP Analysis

Abstract

Talk to us

Similar Papers

More From: Computers and Electronics in Medicine