Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records

Jun-En Ding,Yun-Chien Tseng,Chih-Ho Hsu,Fang-Ming Hung,Yi-Tui Chen,Chenwei Wu,Jian-Zhe Wang,Dongsheng Luo,Wen-Chih Peng,Chi-Te Wang,Phan Nguyen Minh Thao,Chun-Cheng Chug,Feng Liu,Pei-Fu Chen,Ling Chen,Min-Chen Hsieh

doi:10.1038/s41598-024-71020-2

Abstract

Type 2 diabetes mellitus (T2DM) is a prevalent health challenge faced by countries worldwide. In this study, we propose a novel large language multimodal models (LLMMs) framework incorporating multimodal data from clinical notes and laboratory results for diabetes risk prediction. We collected five years of electronic health records (EHRs) dating from 2017 to 2021 from a Taiwan hospital database. This dataset included 1,420,596 clinical notes, 387,392 laboratory results, and more than 1505 laboratory test items. Our method combined a text embedding encoder and multi-head attention layer to learn laboratory values, and utilized a deep neural network (DNN) module to merge blood features with chronic disease semantics into a latent space. In our experiments, we observed that integrating clinical notes with predictions based on textual laboratory values significantly enhanced the predictive capability of the unimodal model in the early detection of T2DM. Moreover, we achieved an area greater than 0.70 under the receiver operating characteristic curve (AUC) for new-onset T2DM prediction, demonstrating the effectiveness of leveraging textual laboratory data for training and inference in LLMs and improving the accuracy of new-onset diabetes prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Sep 6, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

COVID-19-Related Trends and Characteristics of Type 2 Diabetes Mellitus and Metabolic Syndrome.
Brittany N Franco ... Shinichi Asano
Cureus | VOL. 14
Brittany N Franco, et. al.Brittany N Franco ... Shinichi Asano
21 Jan 2022
Cureus | VOL. 14

Increased risk of tuberculosis in patients with type 1 diabetes mellitus: results from a population-based cohort study in Taiwan.
Te-Chun Shen ... Chang-Ching Wei
Medicine | VOL. 93
Te-Chun Shen, et. al.Te-Chun Shen ... Chang-Ching Wei
01 Oct 2014
Medicine | VOL. 93

Classification and Prediction on the Effects of Nutritional Intake on Overweight/Obesity, Dyslipidemia, Hypertension and Type 2 Diabetes Mellitus Using Deep Learning Model: 4-7th Korea National Health and Nutrition Examination Survey.
Hyerim Kim ... Yoona Kim
International journal of environmental research and public health | VOL. 18
Hyerim Kim, et. al.Hyerim Kim ... Yoona Kim
24 May 2021
International journal of environmental research and public health | VOL. 18

U-shaped association between serum IGF2BP3 and T2DM: A cross-sectional study in Chinese population.
Xiaoying Wu ... Wei Wang
Journal of Diabetes | VOL. 15
Xiaoying Wu, et. al.Xiaoying Wu ... Wei Wang
09 Mar 2023
Journal of Diabetes | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large language multimodal models for new-onset type 2 diabetes prediction using five-year cohort electronic health records

Abstract

Talk to us

Similar Papers

More From: Scientific Reports