Performance Assessment of ChatGPT versus Bard in Detecting Alzheimer's Dementia.

Balamurali B.T,Jer-Ming Chen

doi:10.3390/diagnostics14080817

Abstract

Large language models (LLMs) find increasing applications in many fields. Here, three LLM chatbots (ChatGPT-3.5, ChatGPT-4, and Bard) are assessed in their current form, as publicly available, for their ability to recognize Alzheimer's dementia (AD) and Cognitively Normal (CN) individuals using textual input derived from spontaneous speech recordings. A zero-shot learning approach is used at two levels of independent queries, with the second query (chain-of-thought prompting) eliciting more detailed information than the first. Each LLM chatbot's performance is evaluated on the prediction generated in terms of accuracy, sensitivity, specificity, precision, and F1 score. LLM chatbots generated a three-class outcome ("AD", "CN", or "Unsure"). When positively identifying AD, Bard produced the highest true-positives (89% recall) and highest F1 score (71%), but tended to misidentify CN as AD, with high confidence (low "Unsure" rates); for positively identifying CN, GPT-4 resulted in the highest true-negatives at 56% and highest F1 score (62%), adopting a diplomatic stance (moderate "Unsure" rates). Overall, the three LLM chatbots can identify AD vs. CN, surpassing chance-levels, but do not currently satisfy the requirements for clinical application.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance Assessment of ChatGPT versus Bard in Detecting Alzheimer's Dementia.

Abstract

Talk to us

Similar Papers

More From: Diagnostics

Lead the way for us

Journal: Diagnostics	Publication Date: Apr 15, 2024
License type: CC BY 4.0

Similar Papers

ASSOCIATION OF HYPERTENSION WITH IN VIVO ALZHEIMER’S DISEASE PATHOLOGIES IN COGNITIVELY NORMAL, MCI AND DEMENTIA INDIVIDUALS
So Yeon Jeon ... Dong Young Lee
Alzheimer's & Dementia: The Journal of the Alzheimer's Association | VOL. 13
So Yeon Jeon, et. al.So Yeon Jeon ... Dong Young Lee
01 Jul 2017
Alzheimer's & Dementia: The Journal of the Alzheimer's Association | VOL. 13

0268 Characterization of Neurodegenerative Disorder Subtypes Based on Non-REM Hypertonia and Sleep Spindle Duration
Daniel Levendowski ... Gandis Mazeika
Sleep | VOL. 45
Daniel Levendowski, et. al.Daniel Levendowski ... Gandis Mazeika
25 May 2022
Sleep | VOL. 45

Antihypertensive drugs decrease risk of Alzheimer disease
S Yasar ... C H Kawas
Neurology | VOL. 81
S Yasar, et. al.S Yasar ... C H Kawas
02 Aug 2013
Neurology | VOL. 81

Revised Criteria for Mild Cognitive Impairment May Compromise the Diagnosis of Alzheimer Disease Dementia
John C Morris
Archives of Neurology | VOL. 69
John C MorrisJohn C Morris
01 Jun 2012
Archives of Neurology | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Assessment of ChatGPT versus Bard in Detecting Alzheimer's Dementia.

Abstract

Talk to us

Similar Papers

More From: Diagnostics