Performance of ChatGPT in Israeli Hebrew OBGYN national residency examinations.

Adiel Cohen,Raanan Meyer,Naama Lessans,Gabriel Levin,Roie Alter,Yoav Brezinov

doi:10.1007/s00404-023-07185-4

Abstract

Previous studies of ChatGPT performance in the field of medical examinations have reached contradictory results. Moreover, the performance of ChatGPT in other languages other than English is yet to be explored. We aim to study the performance of ChatGPT in Hebrew OBGYN-'Shlav-Alef' (Phase 1) examination. A performance study was conducted using a consecutive sample of text-based multiple choice questions, originated from authentic Hebrew OBGYN-'Shlav-Alef' examinations in 2021-2022. We constructed 150 multiple choice questions from consecutive text-based-only original questions. We compared the performance of ChatGPT performance to the real-life actual performance of OBGYN residents who completed the tests in 2021-2022. We also compared ChatGTP Hebrew performance vs. previously published English medical tests. In 2021-2022, 27.8% of OBGYN residents failed the 'Shlav-Alef' examination and the mean score of the residents was 68.4. Overall, 150 authentic questions were evaluated (one examination). ChatGPT correctly answered 58 questions (38.7%) and reached a failed score. The performance of Hebrew ChatGPT was lower when compared to actual performance of residents: 38.7% vs. 68.4%, p < .001. In a comparison to ChatGPT performance in 9,091 English language questions in the field of medicine, the performance of Hebrew ChatGPT was lower (38.7% in Hebrew vs. 60.7% in English, p < .001). ChatGPT answered correctly on less than 40% of Hebrew OBGYN resident examination questions. Residents cannot rely on ChatGPT for the preparation of this examination. Efforts should be made to improve ChatGPT performance in other languages besides English.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of ChatGPT in Israeli Hebrew OBGYN national residency examinations.

Abstract

Talk to us

Similar Papers

More From: Archives of Gynecology and Obstetrics

Lead the way for us

Journal: Archives of Gynecology and Obstetrics	Publication Date: Sep 5, 2023
Citations: 8

Similar Papers

THE EVALUATION OF NATIONAL EXAMINATION OF ENGLISH SUBJECT AT SECONDARY SCHOOL
Indra Yoga Prawiro
Wiralodra English Journal | VOL. 2
Indra Yoga PrawiroIndra Yoga Prawiro
01 Sep 2018
Wiralodra English Journal | VOL. 2

THE EVALUATION OF NATIONAL EXAMINATION OF ENGLISH SUBJECT AT SECONDARY SCHOOL
Indra Yoga Prawiro
Wiralodra English Journal | VOL. 2
Indra Yoga PrawiroIndra Yoga Prawiro
05 Jul 2019
Wiralodra English Journal | VOL. 2

THE OPTIMAL NUMBER OF OPTIONS USED IN MULTIPLE-CHOICE TEST FORMAT FOR NATIONAL EXAMINATIONS IN INDONESIA
Herland Franley Manalu ... Diana Anggraeni
Humanities & Social Sciences Reviews | VOL. 8
Herland Franley Manalu, et. al.Herland Franley Manalu ... Diana Anggraeni
05 May 2020
Humanities & Social Sciences Reviews | VOL. 8

The relationship of national inservice examination scores, emergency medicine faculty evaluations, and level of training of emergency medicine residents
J.G Ryan ... M.F Ward
Annals of Emergency Medicine | VOL. 44
J.G Ryan, et. al.J.G Ryan ... M.F Ward
25 Sep 2004
Annals of Emergency Medicine | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of ChatGPT in Israeli Hebrew OBGYN national residency examinations.

Abstract

Talk to us

Similar Papers

More From: Archives of Gynecology and Obstetrics