Evaluation of validity and reliability of AI Chatbots as public sources of information on dental trauma.

Ashish J Johnson,Tarun Kumar Singh,Aakash Gupta,Hariram Sankar,Ikroop Gill,Madhav Shalini,Neeraj Mohan

doi:10.1111/edt.13000

Abstract

This study aimed to assess the validity and reliability of AI chatbots, including Bing, ChatGPT 3.5, Google Gemini, and Claude AI, in addressing frequently asked questions (FAQs) related to dental trauma. A set of 30 FAQs was initially formulated by collecting responses from four AI chatbots. A panel comprising expert endodontists and maxillofacial surgeons then refined these to a final selection of 20 questions. Each question was entered into each chatbot three times, generating a total of 240 responses. These responses were evaluated using the Global Quality Score (GQS) on a 5-point Likert scale (5: strongly agree; 4: agree; 3: neutral; 2: disagree; 1: strongly disagree). Any disagreements in scoring were resolved through evidence-based discussions. The validity of the responses was determined by categorizing them as valid or invalid based on two thresholds: a low threshold (scores of ≥ 4 for all three responses) and a high threshold (scores of 5 for all three responses). A chi-squared test was used to compare the validity of the responses between the chatbots. Cronbach's alpha was calculated to assess the reliability by evaluating the consistency of repeated responses from each chatbot. The results indicate that the Claude AI chatbot demonstrated superior validity and reliability compared to ChatGPT and Google Gemini, whereas Bing was found to be less reliable. These findings underscore the need for authorities to establish strict guidelines to ensure the accuracy of medical information provided by AI chatbots.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of validity and reliability of AI Chatbots as public sources of information on dental trauma.

Abstract

Talk to us

Similar Papers

More From: Dental traumatology : official publication of International Association for Dental Traumatology

Lead the way for us

Similar Papers

Validity and reliability of artificial intelligence chatbots as public sources of information on endodontics.
Hossein Mohammad-Rahimi ... Mohamad Amin Pourhoseingholi
International Endodontic Journal | VOL. 57
Hossein Mohammad-Rahimi, et. al.Hossein Mohammad-Rahimi ... Mohamad Amin Pourhoseingholi
20 Dec 2023
International Endodontic Journal | VOL. 57

What Role Does AI Chatbot Perform in the F&B Industry? Perspective from Loyalty and Value Co-Creation: Integrated PLS-SEM and ANN Techniques
Binh Hai Thi Nguyen ... Luan Thanh Nguyen
Journal of Law and Sustainable Development | VOL. 11
Binh Hai Thi Nguyen, et. al.Binh Hai Thi Nguyen ... Luan Thanh Nguyen
24 Aug 2023
Journal of Law and Sustainable Development | VOL. 11

Investigating Factors Impacting Customer Acceptance of Artificial Intelligence Chatbot: Banking Sector of Kuwait
Wael Abdallah ... Osama Mosusa
International Journal of Applied Research in Management and Economics | VOL. 5
Wael Abdallah, et. al.Wael Abdallah ... Osama Mosusa
07 Jan 2023
International Journal of Applied Research in Management and Economics | VOL. 5

Empowering student self‐regulated learning and science education through ChatGPT: A pioneering pilot study
Davy Tsz Kit Ng ... Chee Wei Tan
British Journal of Educational Technology | VOL. 55
Davy Tsz Kit Ng, et. al.Davy Tsz Kit Ng ... Chee Wei Tan
22 Mar 2024
British Journal of Educational Technology | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of validity and reliability of AI Chatbots as public sources of information on dental trauma.

Abstract

Talk to us

Similar Papers

More From: Dental traumatology : official publication of International Association for Dental Traumatology