The performance of OpenAI ChatGPT-4 and Google Gemini in virology multiple-choice questions: a comparative analysis of English and Arabic responses

Malik Sallam,Kholoud Al-Mahzoum,Rawan Ahmad Almutawaa,Jasmen Ahmad Alhashash,Retaj Abdullah Dashti,Danah Raed Alsafy,Reem Abdullah Almutairi,Muna Barakat

doi:10.1186/s13104-024-06920-7

Abstract

ObjectiveThe integration of artificial intelligence (AI) in healthcare education is inevitable. Understanding the proficiency of generative AI in different languages to answer complex questions is crucial for educational purposes. The study objective was to compare the performance ChatGPT-4 and Gemini in answering Virology multiple-choice questions (MCQs) in English and Arabic, while assessing the quality of the generated content. Both AI models’ responses to 40 Virology MCQs were assessed for correctness and quality based on the CLEAR tool designed for evaluation of AI-generated content. The MCQs were classified into lower and higher cognitive categories based on the revised Bloom’s taxonomy. The study design considered the METRICS checklist for the design and reporting of generative AI-based studies in healthcare.ResultsChatGPT-4 and Gemini performed better in English compared to Arabic, with ChatGPT-4 consistently surpassing Gemini in correctness and CLEAR scores. ChatGPT-4 led Gemini with 80% vs. 62.5% correctness in English compared to 65% vs. 55% in Arabic. For both AI models, superior performance in lower cognitive domains was reported. Both ChatGPT-4 and Gemini exhibited potential in educational applications; nevertheless, their performance varied across languages highlighting the importance of continued development to ensure the effective AI integration in healthcare education globally.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The performance of OpenAI ChatGPT-4 and Google Gemini in virology multiple-choice questions: a comparative analysis of English and Arabic responses

Abstract

Talk to us

Similar Papers

More From: BMC Research Notes

Lead the way for us

Journal: BMC Research Notes	Publication Date: Sep 3, 2024
License type: CC BY-NC-ND 4.0

Similar Papers

Real-World Surveillance of FDA-Cleared Artificial Intelligence Models: Rationale and Logistics.
Keith J Dreyer ... Christoph Wald
Journal of the American College of Radiology | VOL. 19
Keith J Dreyer, et. al.Keith J Dreyer ... Christoph Wald
01 Feb 2022
Journal of the American College of Radiology | VOL. 19

Predictive modeling in reproductive medicine: Where will the future of artificial intelligence research take us?
Carol Lynn Curchoe ... Zev Rosenwaks
Fertility and Sterility | VOL. 114
Carol Lynn Curchoe, et. al.Carol Lynn Curchoe ... Zev Rosenwaks
01 Nov 2020
Fertility and Sterility | VOL. 114

Advancing healthcare: the role and impact of AI and foundation models.
Nandhini Mahesh
American journal of translational research | VOL. 16
Nandhini MaheshNandhini Mahesh
01 Jan 2024
American journal of translational research | VOL. 16

The potential impact of ChatGPT in clinical and translational medicine.
Vivian Weiwen Xue ... Pinggui Lei
Clinical and Translational Medicine | VOL. 13
Vivian Weiwen Xue, et. al.Vivian Weiwen Xue ... Pinggui Lei
01 Mar 2023
Clinical and Translational Medicine | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The performance of OpenAI ChatGPT-4 and Google Gemini in virology multiple-choice questions: a comparative analysis of English and Arabic responses

Abstract

Talk to us

Similar Papers

More From: BMC Research Notes