Assessing large language models' accuracy in providing patient support for choroidal melanoma.

Rodrigo Anguita,Mandeep S Sagoo,Lorenzo Ferro Desideri,Catriona Downie

doi:10.1038/s41433-024-03231-w

Abstract

This study aimed to evaluate the accuracy of information that patients can obtain from large language models (LLMs) when seeking answers to common questions about choroidal melanoma. Comparative study comparing frequently asked questions from choroidal melanoma patients and queried three major LLMs-ChatGPT 3.5, Bing AI, and DocsGPT. Answers were reviewed by three ocular oncology experts and scored as accurate, partially accurate, or inaccurate. Statistical analysis compared the quality of responses across models. For medical advice questions, ChatGPT gave 92% accurate responses compared to 58% for Bing AI and DocsGPT. For pre/post-op questions, ChatGPT and Bing AI were 86% accurate while DocsGPT was 73% accurate. There were no statistically significant differences between models. ChatGPT responses were the longest while Bing AI responses were the shortest, but length did not affect accuracy. All LLMs appropriately directed patients to seek medical advice from professionals. LLMs show promising capability to address common choroidal melanoma patient questions at generally acceptable accuracy levels. However, inconsistent, and inaccurate responses do occur, highlighting the need for improved fine-tuning and oversight before integration into clinical practice.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessing large language models' accuracy in providing patient support for choroidal melanoma.

Abstract

Talk to us

Similar Papers

More From: Eye (London, England)

Lead the way for us

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... Bianca Maria Colosimo
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... Bianca Maria Colosimo
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Performance of Large Language Models on Medical Oncology Examination Questions
Jack B Longwell ... Rahul G Krishnan
JAMA Network Open | VOL. 7
Jack B Longwell, et. al.Jack B Longwell ... Rahul G Krishnan
18 Jun 2024
JAMA Network Open | VOL. 7

Leveraging Large Language Models for Precision Monitoring of Chemotherapy-Induced Toxicities: A Pilot Study with Expert Comparisons and Future Directions.
Oskitz Ruiz Sarrias ... Covadonga Figaredo Berjano
Cancers | VOL. 16
Oskitz Ruiz Sarrias, et. al.Oskitz Ruiz Sarrias ... Covadonga Figaredo Berjano
12 Aug 2024
Cancers | VOL. 16

Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy.
Murat Tepe ... Emre Emekli
Cureus | VOL. 16
Murat Tepe, et. al.Murat Tepe ... Emre Emekli
09 May 2024
Cureus | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessing large language models' accuracy in providing patient support for choroidal melanoma.

Abstract

Talk to us

Similar Papers

More From: Eye (London, England)