Polite AI mitigates user susceptibility to AI hallucinations

Richard Pak,Ericka Rovira,Anne Mclaughlin

doi:10.1080/00140139.2024.2434604

Richard Pak, Ericka Rovira + Show 1 more

https://doi.org/10.1080/00140139.2024.2434604

Copy DOI

Export

Save

Cite

Journal: Ergonomics

Publication Date: Nov 28, 2024

Abstract
Full-Text
Similar Papers

Abstract

Listen

With their increased capability, AI-based chatbots have become increasingly popular tools to help users answer complex queries. However, these chatbots may hallucinate, or generate incorrect but very plausible-sounding information, more frequently than previously thought. Thus, it is crucial to examine strategies to mitigate human susceptibility to hallucinated output. In a between-subjects experiment, participants completed a difficult quiz with assistance from either a polite or neutral-toned AI chatbot, which occasionally provided hallucinated (incorrect) information. Signal detection analysis revealed that participants interacting with polite-AI showed modestly higher sensitivity in detecting hallucinations and a more conservative response bias compared to those interacting with neutral-toned AI. While the observed effect sizes were modest, even small improvements in users’ ability to detect AI hallucinations can have significant consequences, particularly in high-stakes domains or when aggregated across millions of AI interactions.

Full Text