Abstract

Uropathy is a serious chronic disease whose prevalence is increasing at an alarming rate. Early detection and prediction of diabetes in women is important because of the increased risk of diabetes-related complications during pregnancy. This study introduces machine learning models to assess the likelihood of diabetes in women. The importance of studying characteristics and improving prediction accuracy to understand the nuances of categorization. Specifically, for data preprocessing, experiments are conducted to solve the problem of missing values and outliers by replacing the zero values of certain features with the median values of the corresponding features. This step reduces the impact of less reliable data on model performance. As recognition models, Gaussian Naive Bayes (GNB), Support Vector Machine (SVM), and Random Forest (RF) are built. Performance analysis is performed along with a careful exploration of the hyperparameter space. Scores for Receiver Operating Characteristic - Area Under the Curve (ROC-AUC) are used to compare various models. Different features affect the classification to different degrees. The experimental findings indicate that the modified random forest model demonstrates superior prediction accuracy and robustness. These findings can assist physicians in predicting a patient's risk of developing diabetes earlier.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call