Translational Science Award Research Articles

Accurate medical coding is essential for clinical and administrative purposes but complicated, time-consuming, and biased. This study compares Retrieval-Augmented Generation (RAG)-enhanced LLMs to provider-assigned codes in producing ICD-10-CM codes from emergency department (ED) clinical records. Retrospective cohort study using 500 ED visits randomly selected from the Mount Sinai Health System between January and April 2024. The RAG system integrated past 1,038,066 ED visits data (2021-2023) into the LLMs' predictions to improve coding accuracy. Nine commercial and open-source LLMs were evaluated. The primary outcome was a head-to-head comparison of the ICD-10-CM codes generated by the RAG-enhanced LLMs and those assigned by the original providers. A panel of four physicians and two LLMs blindly reviewed the codes, comparing the RAG-enhanced LLM and provider-assigned codes on accuracy and specificity. RAG-enhanced LLMs demonstrated superior performance to provider coders in both the accuracy and specificity of code assignments. In a targeted evaluation of 200 cases where discrepancies existed between GPT-4 and provider-assigned codes, human reviewers favored GPT-4 for accuracy in 447 instances, compared to 277 instances where providers' codes were preferred (p<0.001). Similarly, GPT-4 was selected for its superior specificity in 509 cases, whereas human coders were preferred in only 181 cases (p<0.001). Smaller open-access models, such as Llama-3.1-70B, also demonstrated substantial scalability when enhanced with RAG, with 218 instances of accuracy preference compared to 90 for providers' codes. Furthermore, across all models, the exact match rate between LLM-generated and provider-assigned codes significantly improved following RAG integration, with Qwen-2-7B increasing from 0.8% to 17.6% and Gemma-2-9b-it improving from 7.2% to 26.4%. RAG-enhanced LLMs improve medical coding accuracy in EDs, suggesting clinical workflow applications. These findings show that generative AI can improve clinical outcomes and reduce administrative burdens. This work was supported in part through the computational and data resources and staff expertise provided by Scientific Computing and Data at the Icahn School of Medicine at Mount Sinai and supported by the Clinical and Translational Science Awards (CTSA) grant UL1TR004419 from the National Center for Advancing Translational Sciences. Research reported in this publication was also supported by the Office of Research Infrastructure of the National Institutes of Health under award number S10OD026880 and S10OD030463. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The funders played no role in study design, data collection, analysis and interpretation of data, or the writing of this manuscript. A study showed AI models with retrieval-augmented generation outperformed human doctors in ED diagnostic coding accuracy and specificity. Even smaller AI models perform favorably when using RAG. This suggests potential for reducing administrative burden in healthcare, improving coding efficiency, and enhancing clinical documentation.

One in three adults in the United States suffers from metabolic syndrome (MetSyn), a state of insulin resistance associated with reduced cerebral blood flow (CBF) and increased risk of stroke and neurologic diseases. Notably, females are at greater risk for cardiovascular and metabolic complications derived from insulin resistance. Still, it remains unclear whether the reduction in CBF seen in MetSyn is sex-specific or uniform between various brain regions. Additionally, fasting triglyceride (TG) levels are stronger predictors of cognitive disease in females than in males. However, little data exist examining the relationship between TG and CBF in females with MetSyn. Therefore, we aimed to examine the interaction between sex and MetSyn on global and regional CBF in young adults and to explore the relationship between fasting TG levels and CBF. We predict that males and females with MetSyn will have lower global and regional CBF compared to healthy controls of the same sex, but to a greater extent in females with MetSyn. Additionally, we predict fasting TG levels will have stronger relationships with CBF in females than in males with MetSyn. Twenty-five healthy controls (CON; 7 females; 24 ± 4 y) and 13 individuals with MetSyn (5 females; 27 ± 8 y) completed this study. Subjects were unmedicated, and females were studied on days 1-5 of their menstrual cycle. A metabolic panel, blood pressure, and waist circumference were collected in a fasted state. Magnetic resonance imaging (MRI, 3 Tesla) with arterial spin labeling was used to quantify global and regional CBF. Two-way (Group x Sex) linear models with Shaffer’s correction and Tukey adjusted post hocs were used to determine differences in CBF. In the MetSyn groups, Pearson correlation coeffcients were assessed between fasting TG levels and each global and regional CBF value. Global CBF was greater in female CON compared to all groups (p ≤ 0.01), and male CON had greater CBF than male MetSyn (p = 0.03), whereas there was no difference between male and female MetSyn (p = 0.43). Regional analyses resulted in the same pattern of between-group differences in the parietal lobe (p = 0.03), the precuneus (p = 0.01), and the frontal lobe (p = 0.01), outside of a nonsignificant difference between male CON and male MetSyn in the frontal lobe (p = 0.12). No interactions existed for the temporal lobe, occipital lobe, or thalamus (p ≥ 0.06). In females with MetSyn, fasting TG levels were moderately negatively correlated with CBF (r = -0.65), whereas a moderate positive relationship existed in males with MetSyn (r = 0.71). These data suggest that MetSyn reduces CBF to a greater extent in females than males, abolishing sex-dependent differences in CBF present in healthy young adults. While female sex is protective against cardiovascular disease in healthy young adults, this is not the case in individuals with MetSyn. Additionally, males and females with MetSyn had opposing relationships between fasting TG levels and CBF. These findings may help explain the greater cardiovascular risk in females with insulin resistance. American Diabetes Association (ADA1-16-ICTS-099 and ADA 1-12-IN-39); Clinical and Translational Science Award program (UL1TR002373); Wisconsin National Primate Research Center (P51OD011106). This is the full abstract presented at the American Physiology Summit 2024 meeting and is only available in HTML format. There are no additional versions or additional content available for this abstract. Physiology was not involved in the peer review process.

Translational Science Award Research Articles

Related Topics

Articles published on Translational Science Award

Assessing Retrieval-Augmented Large Language Model Performance in Emergency Department ICD-10-CM Coding Compared to Human Coders.

The intersection of Interprofessional Education and One Health: A qualitative study in human and veterinary medical institutions

Understanding enterprise data warehouses to support clinical and translational research: impact, sustainability, demand management, and accessibility.

Title: Influence of the NASA SPRINT exercise protocol on myonuclear and satellite cell content of the vastus lateralis and soleus during 70 days of bedrest

Fiber type specific myonuclear and satellite cell content comparisons between the vastus lateralis and soleus muscles in humans

Metabolic syndrome reduces cerebral blood flow more in women than men

The uses of 3-dimensional printing technology in orthodontic offices in North America

Gut microbiome correlates of recurrent urinary tract infection: a longitudinal, multi-center study

Altered transcriptomic immune responses of maintenance hemodialysis patients to the COVID-19 mRNA vaccine.

531 Transforming a Pilot Grant Program to Advance Clinical & Translational Science

154 Leveraging Implementation Science Competencies to Establish a D&I Science Core

559 Ideation Jams: Catalyzing Interdisciplinary Teams to Maximize Research Impact

555 Design Lab Methodology Supports Innovation in Clinical Trials

99 Using LGBTQ+ Community Expertise to Co-Develop Inclusive Sexual Orientation and Gender Identity (SOGI) Screening for Research Studies

Implementation and Evaluation of a National Well-Being Curriculum for KL2 Scholars.

Development of TRACER: A Translational Research Accomplishments Cataloguerfor Clinical and Translational Science Award hub activity tracking, evaluation, and decision-making.

Translation in action: Influence, collaboration, and evolution of COVID-19 research with Clinical and Translational Science Awards consortium support.

A national unmet needs assessment for CTSA-affiliated electronic health record data networks: A customer discovery approach.

A landscape assessment of CTSA evaluators and their work in the CTSA consortium, 2021 survey findings.

Developing a shared understanding of translational science within CTSA hubs through facilitated retreats: A case study.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Translational Science Award Research Articles

Related Topics

Articles published on Translational Science Award

Assessing Retrieval-Augmented Large Language Model Performance in Emergency Department ICD-10-CM Coding Compared to Human Coders.

The intersection of Interprofessional Education and One Health: A qualitative study in human and veterinary medical institutions

Understanding enterprise data warehouses to support clinical and translational research: impact, sustainability, demand management, and accessibility.

Title: Influence of the NASA SPRINT exercise protocol on myonuclear and satellite cell content of the vastus lateralis and soleus during 70 days of bedrest

Fiber type specific myonuclear and satellite cell content comparisons between the vastus lateralis and soleus muscles in humans

Metabolic syndrome reduces cerebral blood flow more in women than men

The uses of 3-dimensional printing technology in orthodontic offices in North America

Gut microbiome correlates of recurrent urinary tract infection: a longitudinal, multi-center study

Altered transcriptomic immune responses of maintenance hemodialysis patients to the COVID-19 mRNA vaccine.

531 Transforming a Pilot Grant Program to Advance Clinical &amp; Translational Science

154 Leveraging Implementation Science Competencies to Establish a D&amp;I Science Core

559 Ideation Jams: Catalyzing Interdisciplinary Teams to Maximize Research Impact

555 Design Lab Methodology Supports Innovation in Clinical Trials

99 Using LGBTQ+ Community Expertise to Co-Develop Inclusive Sexual Orientation and Gender Identity (SOGI) Screening for Research Studies

Implementation and Evaluation of a National Well-Being Curriculum for KL2 Scholars.

Development of TRACER: A Translational Research Accomplishments Cataloguerfor Clinical and Translational Science Award hub activity tracking, evaluation, and decision-making.

Translation in action: Influence, collaboration, and evolution of COVID-19 research with Clinical and Translational Science Awards consortium support.

A national unmet needs assessment for CTSA-affiliated electronic health record data networks: A customer discovery approach.

A landscape assessment of CTSA evaluators and their work in the CTSA consortium, 2021 survey findings.

Developing a shared understanding of translational science within CTSA hubs through facilitated retreats: A case study.

531 Transforming a Pilot Grant Program to Advance Clinical & Translational Science

154 Leveraging Implementation Science Competencies to Establish a D&I Science Core