Performance of an Artificial Intelligence Chatbot in Ophthalmic Knowledge Assessment

Andrew Mihalache,Marko M Popovic,Rajeev H Muni

doi:10.1001/jamaophthalmol.2023.1144

Abstract

ChatGPT is an artificial intelligence (AI) chatbot that has significant societal implications. Training curricula using AI are being developed in medicine, and the performance of chatbots in ophthalmology has not been characterized. To assess the performance of ChatGPT in answering practice questions for board certification in ophthalmology. This cross-sectional study used a consecutive sample of text-based multiple-choice questions provided by the OphthoQuestions practice question bank for board certification examination preparation. Of 166 available multiple-choice questions, 125 (75%) were text-based. ChatGPT answered questions from January 9 to 16, 2023, and on February 17, 2023. Our primary outcome was the number of board certification examination practice questions that ChatGPT answered correctly. Our secondary outcomes were the proportion of questions for which ChatGPT provided additional explanations, the mean length of questions and responses provided by ChatGPT, the performance of ChatGPT in answering questions without multiple-choice options, and changes in performance over time. In January 2023, ChatGPT correctly answered 58 of 125 questions (46%). ChatGPT's performance was the best in the category general medicine (11/14; 79%) and poorest in retina and vitreous (0%). The proportion of questions for which ChatGPT provided additional explanations was similar between questions answered correctly and incorrectly (difference, 5.82%; 95% CI, -11.0% to 22.0%; χ21 = 0.45; P = .51). The mean length of questions was similar between questions answered correctly and incorrectly (difference, 21.4 characters; SE, 36.8; 95% CI, -51.4 to 94.3; t = 0.58; df = 123; P = .22). The mean length of responses was similar between questions answered correctly and incorrectly (difference, -80.0 characters; SE, 65.4; 95% CI, -209.5 to 49.5; t = -1.22; df = 123; P = .22). ChatGPT selected the same multiple-choice response as the most common answer provided by ophthalmology trainees on OphthoQuestions 44% of the time. In February 2023, ChatGPT provided a correct response to 73 of 125 multiple-choice questions (58%) and 42 of 78 stand-alone questions (54%) without multiple-choice options. ChatGPT answered approximately half of questions correctly in the OphthoQuestions free trial for ophthalmic board certification preparation. Medical professionals and trainees should appreciate the advances of AI in medicine while acknowledging that ChatGPT as used in this investigation did not answer sufficient multiple-choice questions correctly for it to provide substantial assistance in preparing for board certification at this time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of an Artificial Intelligence Chatbot in Ophthalmic Knowledge Assessment

Abstract

Talk to us

Similar Papers

More From: JAMA ophthalmology

Lead the way for us

Journal: JAMA ophthalmology	Publication Date: Apr 27, 2023
Citations: 159

Similar Papers

Performance of Multimodal Artificial Intelligence Chatbots Evaluated on Clinical Oncology Cases
David Chen ... Srinivas Raman
JAMA Network Open | VOL. 7
David Chen, et. al.David Chen ... Srinivas Raman
23 Oct 2024
JAMA Network Open | VOL. 7

I Am ChatGPT, the ultimate AI Chatbot! Investigating the determinants of users' loyalty and ethical usage concerns of ChatGPT
Ben Niu ... Gustave Florentin Nkoulou Mvondo
Journal of Retailing and Consumer Services | VOL. 76
Ben Niu, et. al.Ben Niu ... Gustave Florentin Nkoulou Mvondo
22 Sep 2023
Journal of Retailing and Consumer Services | VOL. 76

How Well Do Artificial Intelligence Chatbots Respond to the Top Search Queries About Urological Malignancies?
David Musheyev ... Abdo E Kabarriti
European urology | VOL. 85
David Musheyev, et. al.David Musheyev ... Abdo E Kabarriti
10 Aug 2023
European urology | VOL. 85

Artificial Intelligence (AI) Chatbots in Medicine: A Supplement, Not a Substitute.
Ibraheem Altamimi ... Abdullah Altamimi
Cureus | VOL. 15
Ibraheem Altamimi, et. al.Ibraheem Altamimi ... Abdullah Altamimi
25 Jun 2023
Cureus | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of an Artificial Intelligence Chatbot in Ophthalmic Knowledge Assessment

Abstract

Talk to us

Similar Papers

More From: JAMA ophthalmology