Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer

Alexander Pan,David Musheyev,Daniel Bockelman,Stacy Loeb,Abdo E Kabarriti

doi:10.1001/jamaoncol.2023.2947

Abstract

Consumers are increasingly using artificial intelligence (AI) chatbots as a source of information. However, the quality of the cancer information generated by these chatbots has not yet been evaluated using validated instruments. To characterize the quality of information and presence of misinformation about skin, lung, breast, colorectal, and prostate cancers generated by 4 AI chatbots. This cross-sectional study assessed AI chatbots' text responses to the 5 most commonly searched queries related to the 5 most common cancers using validated instruments. Search data were extracted from the publicly available Google Trends platform and identical prompts were used to generate responses from 4 AI chatbots: ChatGPT version 3.5 (OpenAI), Perplexity (Perplexity.AI), Chatsonic (Writesonic), and Bing AI (Microsoft). Google Trends' top 5 search queries related to skin, lung, breast, colorectal, and prostate cancer from January 1, 2021, to January 1, 2023, were input into 4 AI chatbots. The primary outcomes were the quality of consumer health information based on the validated DISCERN instrument (scores from 1 [low] to 5 [high] for quality of information) and the understandability and actionability of this information based on the understandability and actionability domains of the Patient Education Materials Assessment Tool (PEMAT) (scores of 0%-100%, with higher scores indicating a higher level of understandability and actionability). Secondary outcomes included misinformation scored using a 5-item Likert scale (scores from 1 [no misinformation] to 5 [high misinformation]) and readability assessed using the Flesch-Kincaid Grade Level readability score. The analysis included 100 responses from 4 chatbots about the 5 most common search queries for skin, lung, breast, colorectal, and prostate cancer. The quality of text responses generated by the 4 AI chatbots was good (median [range] DISCERN score, 5 [2-5]) and no misinformation was identified. Understandability was moderate (median [range] PEMAT Understandability score, 66.7% [33.3%-90.1%]), and actionability was poor (median [range] PEMAT Actionability score, 20.0% [0%-40.0%]). The responses were written at the college level based on the Flesch-Kincaid Grade Level score. Findings of this cross-sectional study suggest that AI chatbots generally produce accurate information for the top cancer-related search queries, but the responses are not readily actionable and are written at a college reading level. These limitations suggest that AI chatbots should be used supplementarily and not as a primary source for medical information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer

Abstract

Talk to us

Similar Papers

More From: JAMA oncology

Lead the way for us

Journal: JAMA oncology	Publication Date: Aug 24, 2023
Citations: 54

Similar Papers

How Well Do Artificial Intelligence Chatbots Respond to the Top Search Queries About Urological Malignancies?
David Musheyev ... Abdo E Kabarriti
European urology | VOL. 85
David Musheyev, et. al.David Musheyev ... Abdo E Kabarriti
10 Aug 2023
European urology | VOL. 85

Quality of Information About Kidney Stones from Artificial Intelligence Chatbots.
David Musheyev ... James F Borin
Journal of endourology | VOL. 38
David Musheyev, et. al.David Musheyev ... James F Borin
29 Jul 2024
Journal of endourology | VOL. 38

Assessment of Artificial Intelligence Chatbot Responses to Common Patient Questions on Bone Sarcoma.
Kameel Khabaz ... Lauren E Wessel
Journal of surgical oncology | VOL. -
Kameel Khabaz, et. al.Kameel Khabaz ... Lauren E Wessel
29 Oct 2024
Journal of surgical oncology | VOL. -

Comparative Analysis of AI Chatbots ChatGPT, Gemini, and Copilot’s Answers to Common Cataract Questions
Busra Guner Sonmezoglu ... Halil Ibrahim Sonmezoglu
Pakistan Journal of Ophthalmology | VOL. 40
Busra Guner Sonmezoglu, et. al.Busra Guner Sonmezoglu ... Halil Ibrahim Sonmezoglu
01 Oct 2024
Pakistan Journal of Ophthalmology | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer

Abstract

Talk to us

Similar Papers

More From: JAMA oncology