ChatGPT-3.5 and -4.0 Do Not Reliably Create Readable Patient Education Materials for Common Orthopaedic Upper- and Lower-Extremity Conditions

Ryan S Marder,George Abdelmalek,Sean M Richards,Nicolas J Nadeau,Daniel J Garcia,Peter J Attia,Gavin Rallis,Anthony J Scillia

doi:10.1016/j.asmr.2024.101027

Ryan S Marder, George Abdelmalek + Show 6 more

Open Access

https://doi.org/10.1016/j.asmr.2024.101027

Copy DOI

Export

Save

Cite

Journal: Arthroscopy, Sports Medicine, and Rehabilitation	Publication Date: Oct 1, 2024
Citations: 1	License type: cc-by-nc-nd

Abstract
Full-Text
Similar Papers

Abstract

Listen

PurposeTo investigate if ChatGPT 3.5 and 4.0 can serve as a viable tool to create readable patient education materials (PEMs) for patients with common orthopaedic upper and lower extremity conditions. MethodsUtilizing ChatGPT versions 3.5 and 4.0, the authors asked the AI program a series of two questions pertaining to patient education for 50 common orthopaedic upper extremity and 50 common orthopaedic lower extremity pathologies. Two templated questions were created and used for all conditions. Readability scores were calculated using the Python library TextStat. Multiple readability test scores were generated, and a consensus reading level was created taking into account the results of 8 reading tests. ResultsChatGPT 3.5 produced only 2% and 4% of responses at the appropriate reading level for upper and lower extremity conditions, respectively, compared to 54% produced by ChatGPT 4.0 for both upper and lower extremity conditions (both p<0.0001). Following a priming phase, ChatGPT 3.5 did not produce any viable responses for either the upper or lower extremity conditions, compared to 64% for both upper and lower extremity conditions by ChatGPT 4.0 (both p<0.0001). Additionally, ChatGPT 4.0 was more successful than ChatGPT 3.5 in producing viable responses both pre- and post-priming phase based on all available metrics for reading level (all p<0.001), including Automated Readability scores, Coleman Liau Index, Dale Chall Score, Flesch Kincaid Grade, Flesch Reading Ease, Gunning Fog, Linsear Write Formula Score, and Simple Measure of Gobbledygook (SMOG) Index. ConclusionsOur results indicate that ChatGPT unreliably created readable PEMs for common orthopaedic upper and lower extremity conditions at the time of the study. Clinical RelevanceThe findings of this study suggest that ChatGPT, while constantly improving as evidenced by the advances from 3.5 to 4.0, should not be substituted for traditional methods of patient education at this time and in its current state may be utilized as a supplemental resource at the discretion of providers.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

ChatGPT-3.5 and -4.0 Do Not Reliably Create Readable Patient Education Materials for Common Orthopaedic Upper- and Lower-Extremity Conditions

Abstract

Published Version

Talk to us

Similar Papers

More From: Arthroscopy, Sports Medicine, and Rehabilitation

Lead the way for us

Similar Papers

How understandable are the patient education materials about flat foot on the Internet for parents?
Sadettin Çiftci ... Bahattin Kerem Aydin
Medicine | VOL. 102
Sadettin Çiftci, et. al.Sadettin Çiftci ... Bahattin Kerem Aydin
10 Feb 2023
Medicine | VOL. 102

Evaluation of accuracy, reliability, quality, and readability of online patient information materials on coccyx injury.
Emir Kaan İzci ... Fatih Keskin
Medicine | VOL. 102
Emir Kaan İzci, et. al.Emir Kaan İzci ... Fatih Keskin
20 Jan 2023
Medicine | VOL. 102

Readability of patient education materials for bariatric surgery.
Adam Timothy Lucy ... Stephanie L Rakestraw
Surgical Endoscopy | VOL. 37
Adam Timothy Lucy, et. al.Adam Timothy Lucy ... Stephanie L Rakestraw
05 Jun 2023
Surgical Endoscopy | VOL. 37

Optimizing Ophthalmology Patient Education via ChatBot-Generated Materials: Readability Analysis of AI-Generated Patient Education Materials and The American Society of Ophthalmic Plastic and Reconstructive Surgery Patient Brochures.
Kevin Eid ... Diane Wang
Ophthalmic plastic and reconstructive surgery | VOL. 40
Kevin Eid, et. al.Kevin Eid ... Diane Wang
16 Nov 2023
Ophthalmic plastic and reconstructive surgery | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

ChatGPT-3.5 and -4.0 Do Not Reliably Create Readable Patient Education Materials for Common Orthopaedic Upper- and Lower-Extremity Conditions

Abstract

Published Version

Talk to us

Similar Papers

More From: Arthroscopy, Sports Medicine, and Rehabilitation