Evaluation and Comparison of Ophthalmic Scientific Abstracts and References by Current Artificial Intelligence Chatbots

Hong-Uyen Hua,Abdul-Hadi Kaakour,Aleksandra Rachitskaya,Sunil Srivastava,Sumit Sharma,Danny A Mammo

doi:10.1001/jamaophthalmol.2023.3119

Hong-Uyen Hua, Abdul-Hadi Kaakour + Show 4 more

https://doi.org/10.1001/jamaophthalmol.2023.3119

Copy DOI

Abstract

Language-learning model-based artificial intelligence (AI) chatbots are growing in popularity and have significant implications for both patient education and academia. Drawbacks of using AI chatbots in generating scientific abstracts and reference lists, including inaccurate content coming from hallucinations (ie, AI-generated output that deviates from its training data), have not been fully explored. To evaluate and compare the quality of ophthalmic scientific abstracts and references generated by earlier and updated versions of a popular AI chatbot. This cross-sectional comparative study used 2 versions of an AI chatbot to generate scientific abstracts and 10 references for clinical research questions across 7 ophthalmology subspecialties. The abstracts were graded by 2 authors using modified DISCERN criteria and performance evaluation scores. Scores for the chatbot-generated abstracts were compared using the t test. Abstracts were also evaluated by 2 AI output detectors. A hallucination rate for unverifiable references generated by the earlier and updated versions of the chatbot was calculated and compared. The mean modified AI-DISCERN scores for the chatbot-generated abstracts were 35.9 and 38.1 (maximum of 50) for the earlier and updated versions, respectively (P = .30). Using the 2 AI output detectors, the mean fake scores (with a score of 100% meaning generated by AI) for the earlier and updated chatbot-generated abstracts were 65.4% and 10.8%, respectively (P = .01), for one detector and were 69.5% and 42.7% (P = .17) for the second detector. The mean hallucination rates for nonverifiable references generated by the earlier and updated versions were 33% and 29% (P = .74). Both versions of the chatbot generated average-quality abstracts. There was a high hallucination rate of generating fake references, and caution should be used when using these AI resources for health education or academic purposes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation and Comparison of Ophthalmic Scientific Abstracts and References by Current Artificial Intelligence Chatbots

Abstract

Talk to us

Similar Papers

More From: JAMA ophthalmology

Lead the way for us

Journal: JAMA ophthalmology	Publication Date: Jul 27, 2023
Citations: 24

Similar Papers

Do AI chatbots improve students learning outcomes? Evidence from a meta‐analysis
Rong Wu ... Zhonggen Yu
British Journal of Educational Technology | VOL. 55
Rong Wu, et. al.Rong Wu ... Zhonggen Yu
03 May 2023
British Journal of Educational Technology | VOL. 55

Business types matter: new insights into the effects of anthropomorphic cues in AI chatbots
Kibum Youn ... Moonhee Cho
Journal of Services Marketing | VOL. 37
Kibum Youn, et. al.Kibum Youn ... Moonhee Cho
07 Jun 2023
Journal of Services Marketing | VOL. 37

Artificial Intelligence-Based Chatbots to Combat COVID-19 Pandemic: A Scoping Review
Abdollah Mahdavi ... Roya Naemi
Shiraz E-Medical Journal | VOL. 24
Abdollah Mahdavi, et. al.Abdollah Mahdavi ... Roya Naemi
16 Nov 2023
Shiraz E-Medical Journal | VOL. 24

Effects of the use of a conversational artificial intelligence chatbot on medical students’ patient-centered communication skill development in a metaverse environment
Hyeonmi Hong ... Sunghee Shin
Journal of Medicine and Life Science | VOL. 21
Hyeonmi Hong, et. al.Hyeonmi Hong ... Sunghee Shin
30 Sep 2024
Journal of Medicine and Life Science | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation and Comparison of Ophthalmic Scientific Abstracts and References by Current Artificial Intelligence Chatbots

Abstract

Talk to us

Similar Papers

More From: JAMA ophthalmology