Evaluating vision-capable chatbots in interpreting kinematics graphs: a comparative study of free and subscription-based models

Giulia Polverini,Bor Gregorcic

doi:10.3389/feduc.2024.1452414

Abstract

This study investigates the performance of eight large multimodal model (LMM)-based chatbots on the Test of Understanding Graphs in Kinematics (TUG-K), a research-based concept inventory. Graphs are a widely used representation in STEM and medical fields, making them a relevant topic for exploring LMM-based chatbots’ visual interpretation abilities. We evaluated both freely available chatbots (Gemini 1.0 Pro, Claude 3 Sonnet, Microsoft Copilot, and ChatGPT-4o) and subscription-based ones (Gemini 1.0 Ultra, Gemini 1.5 Pro API, Claude 3 Opus, and ChatGPT-4). We found that OpenAI’s chatbots outperform all the others, with ChatGPT-4o showing the overall best performance. Contrary to expectations, we found no notable differences in the overall performance between freely available and subscription-based versions of Gemini and Claude 3 chatbots, with the exception of Gemini 1.5 Pro, available via API. In addition, we found that tasks relying more heavily on linguistic input were generally easier for chatbots than those requiring visual interpretation. The study provides a basis for considerations of LMM-based chatbot applications in STEM and medical education, and suggests directions for future research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating vision-capable chatbots in interpreting kinematics graphs: a comparative study of free and subscription-based models

Abstract

Talk to us

Similar Papers

More From: Frontiers in Education

Lead the way for us

Journal: Frontiers in Education	Publication Date: Oct 23, 2024
License type: CC BY 4.0

Similar Papers

VR and AR Applications in Medical Practice and Education
Min-Chai Hsieh ... Yu-Hsuan Lin
Hu li za zhi The journal of nursing | VOL. 64
Min-Chai Hsieh, et. al.Min-Chai Hsieh ... Yu-Hsuan Lin
01 Dec 2017
Hu li za zhi The journal of nursing | VOL. 64

Preliminary Study of VR and AR Applications in Medical and Healthcare Education
Min Chai Hsieh ... Jia Jin Lee
Journal of Nursing and Health Studies | VOL. 03
Min Chai Hsieh, et. al.Min Chai Hsieh ... Jia Jin Lee
01 Jan 2018
Journal of Nursing and Health Studies | VOL. 03

Machine learning's impact on medical education and research: beneficial or detrimental?
Suresh Kanna Subramaniam ... Muthu Prasanna Pichandy
International Journal of Public Health Science (IJPHS) | VOL. 13
Suresh Kanna Subramaniam, et. al.Suresh Kanna Subramaniam ... Muthu Prasanna Pichandy
01 Sep 2024
International Journal of Public Health Science (IJPHS) | VOL. 13

Comparative Study of Machine Learning Models for Power System Fault Identification and Localization
Rachna Vaish ... U.D Dwivedi
-
Rachna Vaish, et. al.Rachna Vaish ... U.D Dwivedi
11 Feb 2022
11 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating vision-capable chatbots in interpreting kinematics graphs: a comparative study of free and subscription-based models

Abstract

Talk to us

Similar Papers

More From: Frontiers in Education