Lumbar disc herniation with radiculopathy: a comparison of NASS guidelines and ChatGPT

Ankur Kayastha,Kirthika Lakshmanan,Michael J Valentine,Anh Nguyen,Kaushal Dholakia,Daniel Wang

doi:10.1016/j.xnsj.2024.100333

Abstract

BackgroundChatGPT is an advanced language AI able to generate responses to clinical questions regarding lumbar disc herniation with radiculopathy. Artificial intelligence (AI) tools are increasingly being considered to assist clinicians in decision-making. This study compared ChatGPT-3.5 and ChatGPT-4.0 responses to established NASS clinical guidelines and evaluated concordance. MethodsChatGPT-3.5 and ChatGPT-4.0 were prompted with fifteen questions from The 2012 NASS Clinical Guidelines for the Diagnosis and Treatment of Lumbar Disc Herniation with Radiculopathy. Clinical questions organized into categories were directly entered as unmodified queries into ChatGPT. Language output was assessed by two independent authors on September 26, 2023 based on operationally-defined parameters of accuracy, over-conclusiveness, supplementary, and incompleteness. ChatGPT-3.5 and ChatGPT-4.0 performance was compared via chi-square analyses. ResultsAmong the fifteen responses produced by ChatGPT-3.5, seven (47%) were accurate, seven (47%) were over-conclusive, fifteen (100%) were supplementary, and six (40%) were incomplete. For ChatGPT-4.0, ten (67%) were accurate, five (33%) were over-conclusive, ten (67%) were supplementary, and six (40%) were incomplete. There was a statistically significant difference in supplementary information (100% vs. 67%; p=0.014) between ChatGPT-3.5 and ChatGPT-4.0. Accuracy (47% vs. 67%; p=0.269), over-conclusiveness (47% vs. 33%; p=0.456), and incompleteness (40% vs. 40%; p=1.000) did not show significant differences between ChatGPT-3.5 and ChatGPT-4.0. ChatGPT-3.5 and ChatGPT-4.0 both yielded 100% accuracy for definition and history and physical examination categories. Diagnostic testing yielded 0% accuracy for ChatGPT-3.5 and 100% accuracy for ChatGPT-4.0. Non-surgical interventions had 50% accuracy for ChatGPT-3.5 and 63% accuracy for ChatGPT-4.0. Surgical interventions resulted in 0% accuracy for ChatGPT-3.5 and 33% accuracy for ChatGPT-4.0. ConclusionsChatGPT-4.0 provided less supplementary information and overall higher accuracy in question categories than ChatGPT-3.5. ChatGPT showed reasonable concordance to NASS guidelines, but clinicians should caution use of ChatGPT in its current state as it fails to safeguard against misinformation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lumbar disc herniation with radiculopathy: a comparison of NASS guidelines and ChatGPT

Abstract

Talk to us

Similar Papers

More From: North American Spine Society Journal (NASSJ)

Lead the way for us

Journal: North American Spine Society Journal (NASSJ)	Publication Date: Jun 1, 2024
License type: cc-by-nc-nd

Similar Papers

Endoscopic percutaneous transforaminal treatment for herniated lumbar discs.
S Eustacchio ... M Trummer
Acta neurochirurgica | VOL. 144
S Eustacchio, et. al.S Eustacchio ... M Trummer
01 Oct 2002
Acta neurochirurgica | VOL. 144

ChatGPT Isn't Magic
Tama Leaver ... Suzanne Srdarov
M/C Journal | VOL. 26
Tama Leaver, et. al.Tama Leaver ... Suzanne Srdarov
02 Oct 2023
M/C Journal | VOL. 26

Epidural Steroid Injection for Lumbar Disc Herniation in NFL Athletes
Aaron J Krych ... Frank Cammisa
Medicine & Science in Sports & Exercise | VOL. 44
Aaron J Krych, et. al.Aaron J Krych ... Frank Cammisa
01 Feb 2012
Medicine & Science in Sports & Exercise | VOL. 44

The Influence of Obesity on the Outcome of Treatment of Lumbar Disc Herniation
Jeffrey A Rihn ... Mark Kurd
Journal of Bone and Joint Surgery | VOL. 95
Jeffrey A Rihn, et. al.Jeffrey A Rihn ... Mark Kurd
02 Jan 2013
Journal of Bone and Joint Surgery | VOL. 95

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lumbar disc herniation with radiculopathy: a comparison of NASS guidelines and ChatGPT

Abstract

Talk to us

Similar Papers

More From: North American Spine Society Journal (NASSJ)