Abstract

Diabetes, also known as Diabetic Mellitus, is a metabolic disease that affects the body's natural blood glucose levels. It is a non-contagious illness with numerous serious health risks. The said illness is rapidly growing in India. It is a chronic disorder that happens when the human body unable to create enough insulin hormone to keep blood sugar levels under control. Several characteristics that cause diabetes were investigated in this study, and multiple machine learning techniques were used to predict whether or not an unknown substance had diabetes. PIMA diabetes detection for female patients was employed for this purpose. For prediction, six distinct classification models were applied. This research presented a comprehensive performance assessment of the multiple factors in the PIMA dataset. Based on all factors of the PIMA dataset, a full discussion was made to demonstrate how diabetes is affected. Finally, in order to forecast the best automated diabetic prediction model, a thorough analysis of many classification approaches was undertaken. It was discovered that the Support Vector Machine (SVM) model delivers the best prediction result, with a reliability of 83.5 percent. Interestingly, Random Forest (RF) Classifier produced the second-best prediction result, with a reliability of 82.76 percent. The study's findings demonstrate that machine learning models produce efficient solutions. The accuracy of the two best machine learning models is 82-83 percent, which can be used for subsequent improvement of the autonomous forecasting tool. The accuracy of these techniques can be improved further by integrating diverse variables for prediction and classification.

Highlights

  • Diabetes Mellitus, known as simple diabetes, is a metabolic condition characterized by insufficient management of blood sugar levels

  • This study reveals the behavior of many crucial parameters for diabetic individuals, as well as the interaction between the primary essential components, and is entirely focused on detecting diabetics in female patients

  • The correlation matrix plot demonstrates that the dataset contains four significant predictor variables. They are as follows: 1) the number of pregnancies related with age, 2) the glucose level related with insulin, 3) the BMI

Read more

Summary

Introduction

Diabetes Mellitus, known as simple diabetes, is a metabolic condition characterized by insufficient management of blood sugar levels. Hyperglycemia, can affect the nerves, eyes, kidneys, skin, and many other organs [3,4,5] This is a highly prevalent and incurable condition. The researchers conducted a comparative investigation of various machine learning techniques in order to assess their performance [6], calculate the different performance factors from the receiver operating characteristic curves [7]. Other researchers combine the PIMA diabetics dataset with data from the Bangladesh. They compared several machine learning techniques for detecting diabetes individuals [8]. Deep learning approach performs well in predicting diabetics [9].

Dataset
Diabetes Pedigree Function
Dataset Analysis
Method
Split the Dataset
Algorithm
Discussions
Results
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call