ChatGPT's Response Consistency: A Study on Repeated Queries of Medical Examination Questions.

Paul F Funk,Barbara Wollenberg,Ali Bashiri Dezfouli,Orlando Guntinas-Lichius,Giuseppe Sofo,Sebastian Cotofana,Michael Alfertshofer,Samuel Knoedler,Cosima C Hoch,Leonard Knoedler

doi:10.3390/ejihpe14030043

Paul F Funk, Barbara Wollenberg + Show 8 more

Open Access

https://doi.org/10.3390/ejihpe14030043

Copy DOI

Abstract

(1) Background: As the field of artificial intelligence (AI) evolves, tools like ChatGPT are increasingly integrated into various domains of medicine, including medical education and research. Given the critical nature of medicine, it is of paramount importance that AI tools offer a high degree of reliability in the information they provide. (2) Methods: A total of n = 450 medical examination questions were manually entered into ChatGPT thrice, each for ChatGPT 3.5 and ChatGPT 4. The responses were collected, and their accuracy and consistency were statistically analyzed throughout the series of entries. (3) Results: ChatGPT 4 displayed a statistically significantly improved accuracy with 85.7% compared to that of 57.7% of ChatGPT 3.5 (p < 0.001). Furthermore, ChatGPT 4 was more consistent, correctly answering 77.8% across all rounds, a significant increase from the 44.9% observed from ChatGPT 3.5 (p < 0.001). (4) Conclusions: The findings underscore the increased accuracy and dependability of ChatGPT 4 in the context of medical education and potential clinical decision making. Nonetheless, the research emphasizes the indispensable nature of human-delivered healthcare and the vital role of continuous assessment in leveraging AI in medicine.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: European journal of investigation in health, psychology and education	Publication Date: Mar 8, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

ChatGPT's Response Consistency: A Study on Repeated Queries of Medical Examination Questions.

Abstract

Talk to us

Similar Papers

More From: European journal of investigation in health, psychology and education

Lead the way for us

Similar Papers

Towards AGI: Cognitive Architecture Based on Hybrid and Bionic Principles
R V Dushkin
-
R V DushkinR V Dushkin
13 Jul 2021
13 Jul 2021

Consideration of breakthrough technologies in the field of genomic research and artificial intelligence in healthcare
L.V Chkhutiashvili
Buhuchet v zdravoohranenii (Accounting in Healthcare) | VOL. -
L.V ChkhutiashviliL.V Chkhutiashvili
01 Nov 2021
Buhuchet v zdravoohranenii (Accounting in Healthcare) | VOL. -

Analysis of the World Experience in the Use of Artificial Intelligence to Optimize Business Processes of Enterprises
K I Dementev
Administrative Consulting | VOL. -
K I DementevK I Dementev
24 Feb 2023
Administrative Consulting | VOL. -

Plan and Develop Advanced Knowledge and Skills for Future Industrial Employees in the Field of Artificial Intelligence, Internet of Things and Edge Computing
Daniele Mazzei ... Daniele Atzeni
Sustainability | VOL. 14
Daniele Mazzei, et. al.Daniele Mazzei ... Daniele Atzeni
11 Mar 2022
Sustainability | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ChatGPT's Response Consistency: A Study on Repeated Queries of Medical Examination Questions.

Abstract

Talk to us

Similar Papers

More From: European journal of investigation in health, psychology and education