Abstract

Objective: In recent decades, the Arab population has experienced an increase in the prevalence of type 2 diabetes (T2DM), particularly within the Gulf Cooperation Council. In this context, early intervention programmes rely on an ability to identify individuals at risk of T2DM. We aimed to build prognostic models for the risk of T2DM in the Arab population using machine-learning algorithms vs. conventional logistic regression (LR) and simple non-invasive clinical markers over three different time scales (3, 5, and 7 years from the baseline).Design: This retrospective cohort study used three models based on LR, k-nearest neighbours (k-NN), and support vector machines (SVM) with five-fold cross-validation. The models included the following baseline non-invasive parameters: age, sex, body mass index (BMI), pre-existing hypertension, family history of hypertension, and T2DM.Setting: This study was based on data from the Kuwait Health Network (KHN), which integrated primary health and hospital laboratory data into a single system.Participants: The study included 1,837 native Kuwaiti Arab individuals (equal proportion of men and women) with mean age as 59.5 ± 11.4 years. Among them, 647 developed T2DM within 7 years of the baseline non-invasive measurements.Analytical methods: The discriminatory power of each model for classifying people at risk of T2DM within 3, 5, or 7 years and the area under the receiver operating characteristic curve (AUC) were determined.Outcome measures: Onset of T2DM at 3, 5, and 7 years.Results: The k-NN machine-learning technique, which yielded AUC values of 0.83, 0.82, and 0.79 for 3-, 5-, and 7-year prediction horizons, respectively, outperformed the most commonly used LR method and other previously reported methods. Comparable results were achieved using the SVM and LR models with corresponding AUC values of (SVM: 0.73, LR: 0.74), (SVM: 0.68, LR: 0.72), and (SVM: 0.71, LR: 0.70) for 3-, 5-, and 7-year prediction horizons, respectively. For all models, the discriminatory power decreased as the prediction horizon increased from 3 to 7 years.Conclusions: Machine-learning techniques represent a useful addition to the commonly reported LR technique. Our prognostic models for the future risk of T2DM could be used to plan and implement early prevention programmes for at risk groups in the Arab population.

Highlights

  • During the last two decades, the Arab world, and countries in the Gulf Cooperation Council, has experienced an unprecedented increase in the prevalence of type 2 diabetes mellitus (T2DM)

  • The cohort comprised 1,837 native Kuwaiti Arab participants with a complete record of the following measurements: age at baseline, body mass index (BMI), family history of diabetes, family history of hypertension, diagnosis of hypertension, sex, and the time interval from the time of study entry to diabetes diagnosis

  • We demonstrated that prognostic models developed using six non-intrusive parameters could identify patients at a high risk of developing T2DM within 3–7 years

Read more

Summary

Introduction

During the last two decades, the Arab world, and countries in the Gulf Cooperation Council, has experienced an unprecedented increase in the prevalence of type 2 diabetes mellitus (T2DM). Previous reported trials, including studies of impaired glucose tolerance (IGT) testing and fasting biomarker levels [3], have demonstrated that lifestyle modifications or the use of medication can substantially reduce the risk of T2DM in people with IGT or elevated fasting and post-load plasma glucose concentrations [4, 5] These tests are relatively invasive, time-consuming, costly and inconvenient. Diabetes risk models based on known non-invasive risk factors and statistical analyses have been generated to identify individuals at future risk of developing T2DM [6,7,8] Such prognosis models can help to correctly identify individuals who should be targeted by intervention programmes and to avoid burdening low-risk individuals with invasive assessments, prevention, and treatment regimens. In other words, such models could improve the efficacy and costeffectiveness of T2DM prevention programmes

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call