Use of Generative AI to Identify Helmet Status Among Patients With Micromobility-Related Injuries From Unstructured Clinical Notes

Kathryn G Burford,Nicole G Itzkowitz,Ashley G Ortega,Julien O Teitler,Andrew G Rundle

doi:10.1001/jamanetworkopen.2024.25981

Abstract

Large language models (LLMs) have potential to increase the efficiency of information extraction from unstructured clinical notes in electronic medical records. To assess the utility and reliability of an LLM, ChatGPT-4 (OpenAI), to analyze clinical narratives and identify helmet use status of patients injured in micromobility-related accidents. This cross-sectional study used publicly available, deidentified 2019 to 2022 data from the US Consumer Product Safety Commission's National Electronic Injury Surveillance System, a nationally representative stratified probability sample of 96 hospitals in the US. Unweighted estimates of e-bike, bicycle, hoverboard, and powered scooter-related injuries that resulted in an emergency department visit were used. Statistical analysis was performed from November 2023 to April 2024. Patient helmet status (wearing vs not wearing vs unknown) was extracted from clinical narratives using (1) a text string search using researcher-generated text strings and (2) the LLM by prompting the system with low-, intermediate-, and high-detail prompts. The level of agreement between the 2 approaches across all 3 prompts was analyzed using Cohen κ test statistics. Fleiss κ was calculated to measure the test-retest reliability of the high-detail prompt across 5 new chat sessions and days. Performance statistics were calculated by comparing results from the high-detail prompt to classifications of helmet status generated by researchers reading the clinical notes (ie, a criterion standard review). Among 54 569 clinical notes, moderate (Cohen κ = 0.74 [95% CI, 0.73-0.75) and weak (Cohen κ = 0.53 [95% CI, 0.52-0.54]) agreement were found between the text string-search approach and the LLM for the low- and intermediate-detail prompts, respectively. The high-detail prompt had almost perfect agreement (κ = 1.00 [95% CI, 1.00-1.00]) but required the greatest amount of time to complete. The LLM did not perfectly replicate its analyses across new sessions and days (Fleiss κ = 0.91 across 5 trials; P < .001). The LLM often hallucinated and was consistent in replicating its hallucinations. It also showed high validity compared with the criterion standard (n = 400; κ = 0.98 [95% CI, 0.96-1.00]). This study's findings suggest that although there are efficiency gains for using the LLM to extract information from clinical notes, the inadequate reliability compared with a text string-search approach, hallucinations, and inconsistent performance significantly hinder the potential of the currently available LLM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JAMA Network Open	Publication Date: Aug 1, 2024
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Use of Generative AI to Identify Helmet Status Among Patients With Micromobility-Related Injuries From Unstructured Clinical Notes

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Similar Papers

Acute backpack injuries in children.
Brent M Wiersema ... Eric J Wall
Pediatrics | VOL. 111
Brent M Wiersema, et. al.Brent M Wiersema ... Eric J Wall
01 Jan 2003
Pediatrics | VOL. 111

185 Skiing and Snowboarding-related Head Injuries in the United States: A Retrospective Analysis From 2004-2010
M Christensen ... C.P Jackson
Annals of Emergency Medicine | VOL. 60
M Christensen, et. al.M Christensen ... C.P Jackson
20 Sep 2012
185 Skiing and Snowboarding-related Head Injuries in the United States: A Retrospective Analysis From 2004-2010
M Christensen ... C.P Jackson

Pediatric all-terrain vehicle (ATV) related head injury rates and patterns: A 10-year nationwide analysis
Arjun Ganga ... Patricia Zadnik Sullivan
The American Journal of Emergency Medicine | VOL. 67
Arjun Ganga, et. al.Arjun Ganga ... Patricia Zadnik Sullivan
10 Feb 2023
The American Journal of Emergency Medicine | VOL. 67

Craniofacial injuries related to motorized scooter use: A rising epidemic.
Amishav Y Bresler ... Boris Paskhover
American Journal of Otolaryngology | VOL. 40
Amishav Y Bresler, et. al.Amishav Y Bresler ... Boris Paskhover
20 May 2019
American Journal of Otolaryngology | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Use of Generative AI to Identify Helmet Status Among Patients With Micromobility-Related Injuries From Unstructured Clinical Notes

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open