Abstract
Background and aimIt is crucial to identify a diabetes diagnosis early. Create a predictive model utilizing machine learning (ML) to identify new cases of diabetes in primary health care (PHC). MethodsA case-control study utilizing data on PHC visits for sex-, age, and PHC-matched controls. Stochastic gradient boosting was used to construct a model for predicting cases of diabetes based on diagnostic codes from PHC consultations during the year before index (diagnosis) date and number of consultations. Variable importance was estimated using the normalized relative influence (NRI) score. Risks of having diabetes were calculated using odds ratios of marginal effects (ORME). Four groups by age and sex were studied, age-groups 35–64 years and ≥ 65 years in men and women, respectively. ResultsThe most important predictive factors were hypertension with NRI 21.4–29.7 %, and obesity 4.8–15.2 %. The NRI for other top ten diagnoses and administrative codes generally ranged 1.0–4.2 %. ConclusionsOur data confirm the known risk patterns for predicting a new diagnosis of diabetes, and the need to test blood glucose frequently. To assess the full potential of ML for risk prediction purposes in clinical practice, future studies could include clinical data on life-style patterns, laboratory tests and prescribed medication.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have