Abstract

AbstractOne of the most fatal and serious diseases that humans have encountered is diabetes, an illness affecting thousands of individuals yearly. In this era of digital systems, diabetes prediction based on machine learning (ML) is gaining high momentum. One of the benefits of treating patients early in the course of their noncommunicable diseases (NCDs) is that they can avoid costly therapies when the illness worsens later in life. Incidentally, diabetes is complicated by the dearth of medical professionals in underserved areas, such as distant rural communities. In these situations, the Internet of Medical Things and machine learning (ML) models can be used to offer healthcare practitioners the necessary prediction tools to more effectively and timely make decisions, thus assisting the early identification and diagnosis of NCDs. In this study, four conventional and hyper-AdaBoost ML models were trained and tested on the PIMA Indian Diabetes dataset. Patients with diabetes were classified on the basis of laboratory findings. Pre-processing tasks, such as the handling of imbalanced data and missing values, were performed prior to feature importance and normalisation activities. The algorithm with the best performance was examined using precision, accuracy, F1, recall and area under the curve metrics. Then, all ML models were hyper parametrically tuned via grid search to optimise their performance and reduce their error times. The decision process was also evaluated to further enhance the models. The AdaBoost-ET model performed even when features were not selected for binary classification. The model proposed in this study can predict diabetes with unprecedented high accuracy compared with the models in previous studies.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call