Exploring the Feasibility of GPT-4 as a Data Extraction Tool for Renal Surgery Operative Notes.

Jessica Y Hsueh,Daniel Nethala,Shiva Singh,Jason A Hyman,David G Gelikman,W Marston Linehan,Mark W Ball

doi:10.1097/upj.0000000000000599

Abstract

GPT-4 is a large language model with potential for multiple applications in urology. Our study sought to evaluate GPT-4's performance in data extraction from renal surgery operative notes. GPT-4 was queried to extract information on laterality, surgery, approach, estimated blood loss, and ischemia time from deidentified operative notes. Match rates were determined by the number of "matched" data points between GPT-4 and human-curated extraction. Accuracy rates were calculated after manually reviewing "not matched" data points. Cohen's kappa and the intraclass coefficient were used to evaluate interrater agreement/reliability. Our cohort consisted of 1498 renal surgeries from 2003 to 2023. Match rates were high for laterality (94.4%), surgery (92.5%), and approach (89.4%), but lower for estimated blood loss (77.1%) and ischemia time (25.6%). GPT-4 was more accurate for estimated blood loss (90.3% vs 85.5% human curated) and similarly accurate for laterality (95.2% vs 95.3% human curated). Human-curated accuracy rates were higher for surgery (99.3% vs 93% GPT-4), approach (97.9% vs 90.8% GPT-4), and ischemia time (95.6% vs 30.7% GPT-4). Cohen's kappa was 0.96 for laterality, 0.83 for approach, and 0.71 for surgery. The intraclass coefficient was 0.62 for estimated blood loss and 0.09 for ischemia time. Match and accuracy rates were higher for categorical variables. GPT-4 data extraction was particularly error prone for variables with heterogenous documentation styles. The role of a standard operative template to aid data extraction will be explored in the future. GPT-4 can be utilized as a helpful and efficient data extraction tool with manual feedback.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring the Feasibility of GPT-4 as a Data Extraction Tool for Renal Surgery Operative Notes.

Abstract

Talk to us

Similar Papers

More From: Urology practice

Lead the way for us

Similar Papers

Investigating the clinical reasoning abilities of large language model GPT-4: an analysis of postoperative complications from renal surgeries
Jessica Y Hsueh ... Mark W Ball
Urologic Oncology: Seminars and Original Investigations | VOL. 42
Jessica Y Hsueh, et. al.Jessica Y Hsueh ... Mark W Ball
06 May 2024
Urologic Oncology: Seminars and Original Investigations | VOL. 42

Automating Systematic Literature Reviews with Retrieval-Augmented Generation: A Comprehensive Overview
Binglan Han ... Anuradha Mathrani
Applied Sciences | VOL. 14
Binglan Han, et. al.Binglan Han ... Anuradha Mathrani
09 Oct 2024
Applied Sciences | VOL. 14

Performance of two large language models for data extraction in evidence synthesis.
Amanda Konet ... Robert Chew
Research synthesis methods | VOL. -
Amanda Konet, et. al.Amanda Konet ... Robert Chew
19 Jun 2024
Research synthesis methods | VOL. -

Targeted Training Data Extraction—Neighborhood Comparison-Based Membership Inference Attacks in Large Language Models
Huan Xu ... Yingbo Wu
Applied Sciences | VOL. 14
Huan Xu, et. al.Huan Xu ... Yingbo Wu
14 Aug 2024
Applied Sciences | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the Feasibility of GPT-4 as a Data Extraction Tool for Renal Surgery Operative Notes.

Abstract

Talk to us

Similar Papers

More From: Urology practice