Artificial intelligence as a modality to enhance the readability of neurosurgical literature for patients.

Gage A Guerra,Sophie Grove,Jonathan Le,Hayden L Hofmann,Ishan Shah,Sweta Bhagavatula,Benjamin Fixman,David Gomez,Benjamin Hopkins,Jonathan Dallas,Giovanni Cacciamani,Racheal Peterson,Gabriel Zada

doi:10.3171/2024.6.jns24617

Abstract

In this study the authors assessed the ability of Chat Generative Pretrained Transformer (ChatGPT) 3.5 and ChatGPT4 to generate readable and accurate summaries of published neurosurgical literature. Abstracts published in journal issues released between June 2023 and August 2023 (n = 150) were randomly selected from the top 5 ranked neurosurgical journals according to Google Scholar. ChatGPT models were instructed to generate a readable layperson summary of the original abstract from a statistically validated prompt. Readability results and grade-level indicators (RR-GLIs) scores were calculated for GPT3.5- and GPT4-generated summaries and original abstracts. Two physicians independently rated the accuracy of ChatGPT-generated layperson summaries to assess scientific validity. One-way ANOVA followed by pairwise t-test with Bonferroni correction were performed to compare readability scores. Cohen's kappa was used to assess interrater agreement between the two rater physicians. Analysis of 150 original abstracts showed a statistically significant difference for all RR-GLIs between the ChatGPT-generated summaries and original abstracts. The readability scores are formatted as follows (original abstract mean, GPT3.5 summary mean, GPT4 summary mean, p value): Flesch-Kincaid reading grade (12.55, 7.80, 7.70, p < 0.0001); Gunning fog score (15.46, 10.00, 9.00, p < 0.0001); Simple Measure of Gobbledygook (SMOG) index (11.30, 7.13, 6.60, p < 0.0001); Coleman-Liau index (14.67, 11.32, 10.26, p < 0.0001); automated readability index (10.87, 8.50, 7.75, p < 0.0001); and Flesch-Kincaid reading ease (33.29, 68.45, 69.55, p < 0.0001). GPT4-generated summaries demonstrated higher RR-GLIs than GPT3.5-generated summaries in the following categories: Gunning fog score (0.0003); SMOG index (0.027); Coleman-Liau index (< 0.0001); sentences (< 0.0001); complex words (< 0.0001); and % complex words (0.0035). A total of 68.4% and 84.2% of GPT3.5- and GPT4-generated summaries, respectively, maintained moderate scientific accuracy according to the two physician-reviewers. The findings demonstrate promising potential for application of the ChatGPT in patient education. GPT4 is an accessible tool that can be an immediate solution to enhancing the readability of current neurosurgical literature. Layperson summaries generated by GPT4 would be a valuable addition to a neurosurgical journal and would be likely to improve comprehension for patients using internet resources like PubMed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Artificial intelligence as a modality to enhance the readability of neurosurgical literature for patients.

Abstract

Talk to us

Similar Papers

More From: Journal of neurosurgery

Lead the way for us

Similar Papers

Dyslexia Articles Unboxed: Analyzing Their Readability Level.
Yusuke Matsuura ... Chung Jaeah
Journal of developmental and behavioral pediatrics : JDBP | VOL. 45
Yusuke Matsuura, et. al.Yusuke Matsuura ... Chung Jaeah
13 Jun 2024
Journal of developmental and behavioral pediatrics : JDBP | VOL. 45

Semantics Matter: Cheiloschisis Web-Based Information Differs from Cleft Lip
Darren B Abbas ... Jennifer B L Parker
Journal of the American College of Surgeons | VOL. 235
Darren B Abbas, et. al.Darren B Abbas ... Jennifer B L Parker
17 Oct 2022
Journal of the American College of Surgeons | VOL. 235

Readability and Quality of Online Patient Education Materials Concerning Posterior Cruciate Ligament Reconstruction.
Michele Venosa ... Emilio Romanini
Cureus | VOL. 16
Michele Venosa, et. al.Michele Venosa ... Emilio Romanini
01 Apr 2024
Cureus | VOL. 16

POS1458 HOW EASY IS IT FOR PATIENTS TO READ AND UNDERSTAND AVAILABLE PATIENT EDUCATIONAL MATERIALS FOR LUPUS?
U C Nweke ... J Meenakshi
Annals of the Rheumatic Diseases | VOL. 81
U C Nweke, et. al.U C Nweke ... J Meenakshi
23 May 2022
POS1458 HOW EASY IS IT FOR PATIENTS TO READ AND UNDERSTAND AVAILABLE PATIENT EDUCATIONAL MATERIALS FOR LUPUS?
U C Nweke ... J Meenakshi

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Artificial intelligence as a modality to enhance the readability of neurosurgical literature for patients.

Abstract

Talk to us

Similar Papers

More From: Journal of neurosurgery