Abstract

ObjectiveTo develop a machine-learning model that can predict the risk of pancreatic ductal adenocarcinoma (PDAC) in people with new-onset diabetes (NOD). MethodsFrom a population-based sample of individuals with NOD aged >50 years, patients with pancreatic cancer-related diabetes (PCRD), defined as NOD followed by a PDAC diagnosis within 3 years, were included (n = 716). These PCRD patients were randomly matched in a 1:1 ratio with individuals having NOD. Data from Danish national health registries were used to develop a random forest model to distinguish PCRD from Type 2 diabetes. The model was based on age, gender, and parameters derived from feature engineering on trajectories of routine biochemical variables. Model performance was evaluated using receiver operating characteristic curves (ROC) and relative risk scores. ResultsThe most discriminative model included 20 features and achieved a ROC-AUC of 0.78 (CI:0.75–0.83). Compared to the general NOD population, the relative risk for PCRD was 20-fold increase for the 1 % of patients predicted by the model to have the highest cancer risk (3-year cancer risk of 12 % and sensitivity of 20 %). Age was the most discriminative single feature, followed by the rate of change in haemoglobin A1c and the latest plasma triglyceride level. When the prediction model was restricted to patients with PDAC diagnosed six months after diabetes diagnosis, the ROC-AUC was 0.74 (CI:0.69–0.79). ConclusionIn a population-based setting, a machine-learning model utilising information on age, sex and trajectories of routine biochemical variables demonstrated good discriminative ability between PCRD and Type 2 diabetes.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.