Artificial Intelligence–Generated Draft Replies to Patient Inbox Messages

Anna Devon-Sand,Carlene Lugtu,Christopher Sharp,Danyelle Clutter,Kevin Takazawa,Kyle Vogt,Margaret Smith,Matthew Rojo,Michael A Pfeffer,Ming Tai-Seale,Patricia Garcia,Shreya Shah,Stephen P Ma,Steven Lin,Tait Shanafelt,Yejin Jeong

doi:10.1001/jamanetworkopen.2024.3201

Anna Devon-Sand, Carlene Lugtu + Show 14 more

Open Access

https://doi.org/10.1001/jamanetworkopen.2024.3201

Copy DOI

Journal: JAMA Network Open	Publication Date: Mar 20, 2024
Citations: 20	License type: cc-by

Affiliation: Stanford University

Abstract

The emergence and promise of generative artificial intelligence (AI) represent a turning point for health care. Rigorous evaluation of generative AI deployment in clinical practice is needed to inform strategic decision-making. To evaluate the implementation of a large language model used to draft responses to patient messages in the electronic inbox. A 5-week, prospective, single-group quality improvement study was conducted from July 10 through August 13, 2023, at a single academic medical center (Stanford Health Care). All attending physicians, advanced practice practitioners, clinic nurses, and clinical pharmacists from the Divisions of Primary Care and Gastroenterology and Hepatology were enrolled in the pilot. Draft replies to patient portal messages generated by a Health Insurance Portability and Accountability Act-compliant electronic health record-integrated large language model. The primary outcome was AI-generated draft reply utilization as a percentage of total patient message replies. Secondary outcomes included changes in time measures and clinician experience as assessed by survey. A total of 197 clinicians were enrolled in the pilot; 35 clinicians who were prepilot beta users, out of office, or not tied to a specific ambulatory clinic were excluded, leaving 162 clinicians included in the analysis. The survey analysis cohort consisted of 73 participants (45.1%) who completed both the presurvey and postsurvey. In gastroenterology and hepatology, there were 58 physicians and APPs and 10 nurses. In primary care, there were 83 physicians and APPs, 4 nurses, and 8 clinical pharmacists. The mean AI-generated draft response utilization rate across clinicians was 20%. There was no change in reply action time, write time, or read time between the prepilot and pilot periods. There were statistically significant reductions in the 4-item physician task load score derivative (mean [SD], 61.31 [17.23] presurvey vs 47.26 [17.11] postsurvey; paired difference, -13.87; 95% CI, -17.38 to -9.50; P < .001) and work exhaustion scores (mean [SD], 1.95 [0.79] presurvey vs 1.62 [0.68] postsurvey; paired difference, -0.33; 95% CI, -0.50 to -0.17; P < .001). In this quality improvement study of an early implementation of generative AI, there was notable adoption, usability, and improvement in assessments of burden and burnout. There was no improvement in time. Further code-to-bedside testing is needed to guide future development and organizational strategy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Artificial Intelligence–Generated Draft Replies to Patient Inbox Messages

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open

Lead the way for us

Similar Papers

ChatGPT Use Among Pediatric Health Care Providers: Cross-Sectional Survey Study.
Susannah Kisvarday ... Eugene Kim
JMIR formative research | VOL. 8
Susannah Kisvarday, et. al.Susannah Kisvarday ... Eugene Kim
28 Jan 2024
JMIR formative research | VOL. 8

Improving postsurgical fall detection for older Americans using LLM-driven analysis of clinical narratives.
Malvika Pillai ... Steven M Asch
medRxiv : the preprint server for health sciences | VOL. -
Malvika Pillai, et. al.Malvika Pillai ... Steven M Asch
26 Jun 2024
medRxiv : the preprint server for health sciences | VOL. -

Boosting LLM-Assisted Diagnosis: 10-Minute LLM Tutorial Elevates Radiology Residents’ Performance in Brain MRI Interpretation
Su Hwan Kim ... Lukas Endroes
-
Su Hwan Kim, et. al.Su Hwan Kim ... Lukas Endroes
05 Jul 2024
05 Jul 2024

Efficient healthcare with large language models: optimizing clinical workflow and enhancing patient care.
Satvik Tripathi ... Tessa S Cook
Journal of the American Medical Informatics Association | VOL. 31
Satvik Tripathi, et. al.Satvik Tripathi ... Tessa S Cook
25 Jan 2024
Journal of the American Medical Informatics Association | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Artificial Intelligence–Generated Draft Replies to Patient Inbox Messages

Abstract

Talk to us

Similar Papers

More From: JAMA Network Open