Use of a Large Language Model to Assess Clinical Acuity of Adults in the Emergency Department

Christopher Y K Williams,Travis Zack,Brenda Y Miao,Madhumita Sushil,Michelle Wang,Aaron E Kornblith,Atul J Butte

doi:10.1001/jamanetworkopen.2024.8895

Christopher Y K Williams, Travis Zack + Show 5 more

Open Access

https://doi.org/10.1001/jamanetworkopen.2024.8895

Copy DOI

Journal: JAMA Network Open	Publication Date: May 7, 2024
Citations: 15	License type: cc-by

Affiliation: University of California, San Francisco

Abstract

The introduction of large language models (LLMs), such as Generative Pre-trained Transformer 4 (GPT-4; OpenAI), has generated significant interest in health care, yet studies evaluating their performance in a clinical setting are lacking. Determination of clinical acuity, a measure of a patient's illness severity and level of required medical attention, is one of the foundational elements of medical reasoning in emergency medicine. To determine whether an LLM can accurately assess clinical acuity in the emergency department (ED). This cross-sectional study identified all adult ED visits from January 1, 2012, to January 17, 2023, at the University of California, San Francisco, with a documented Emergency Severity Index (ESI) acuity level (immediate, emergent, urgent, less urgent, or nonurgent) and with a corresponding ED physician note. A sample of 10 000 pairs of ED visits with nonequivalent ESI scores, balanced for each of the 10 possible pairs of 5 ESI scores, was selected at random. The potential of the LLM to classify acuity levels of patients in the ED based on the ESI across 10 000 patient pairs. Using deidentified clinical text, the LLM was queried to identify the patient with a higher-acuity presentation within each pair based on the patients' clinical history. An earlier LLM was queried to allow comparison with this model. Accuracy score was calculated to evaluate the performance of both LLMs across the 10 000-pair sample. A 500-pair subsample was manually classified by a physician reviewer to compare performance between the LLMs and human classification. From a total of 251 401 adult ED visits, a balanced sample of 10 000 patient pairs was created wherein each pair comprised patients with disparate ESI acuity scores. Across this sample, the LLM correctly inferred the patient with higher acuity for 8940 of 10 000 pairs (accuracy, 0.89 [95% CI, 0.89-0.90]). Performance of the comparator LLM (accuracy, 0.84 [95% CI, 0.83-0.84]) was below that of its successor. Among the 500-pair subsample that was also manually classified, LLM performance (accuracy, 0.88 [95% CI, 0.86-0.91]) was comparable with that of the physician reviewer (accuracy, 0.86 [95% CI, 0.83-0.89]). In this cross-sectional study of 10 000 pairs of ED visits, the LLM accurately identified the patient with higher acuity when given pairs of presenting histories extracted from patients' first ED documentation. These findings suggest that the integration of an LLM into ED workflows could enhance triage processes while maintaining triage quality and warrants further investigation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Use of a Large Language Model to Assess Clinical Acuity of Adults in the Emergency Department

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Similar Papers

Evaluating the Performance of Large Language Models in Hematopoietic Stem Cell Transplantation Decision Making
Ivan Civettini ... Paola Perfetti
Blood | VOL. 142
Ivan Civettini, et. al.Ivan Civettini ... Paola Perfetti
02 Nov 2023
Blood | VOL. 142

A Clinical Nurse Specialist-Driven Project to Improve Emergency Department Triage Accuracy.
Denise Campbell ... Lisa Fetters
Clinical Nurse Specialist | VOL. 36
Denise Campbell, et. al.Denise Campbell ... Lisa Fetters
01 Jan 2021
Clinical Nurse Specialist | VOL. 36

The burden of cannabis-attributed pediatric and adult emergency department visits.
Robert G Hendrickson ... John A Thompson
Academic emergency medicine : official journal of the Society for Academic Emergency Medicine | VOL. 28
Robert G Hendrickson, et. al.Robert G Hendrickson ... John A Thompson
04 Jul 2021
Academic emergency medicine : official journal of the Society for Academic Emergency Medicine | VOL. 28

260: Effect of Emergency Department Crowding on Generation of Cases for Performance Improvement Review
K Stransky ... S Mullenix
Annals of Emergency Medicine | VOL. 56
K Stransky, et. al.K Stransky ... S Mullenix
25 Aug 2010
Annals of Emergency Medicine | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Use of a Large Language Model to Assess Clinical Acuity of Adults in the Emergency Department

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open