Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT.

Thilo Hagendorff,Sarah Fabi,Michal Kosinski

doi:10.1038/s43588-023-00527-x

Abstract

We design a battery of semantic illusions and cognitive reflection tests, aimed to elicit intuitive yet erroneous responses. We administer these tasks, traditionally used to study reasoning and decision-making in humans, to OpenAI's generative pre-trained transformer model family. The results show that as the models expand in size and linguistic proficiency they increasingly display human-like intuitive system 1 thinking and associated cognitive errors. This pattern shifts notably with the introduction of ChatGPT models, which tend to respond correctly, avoiding the traps embedded in the tasks. Both ChatGPT-3.5 and 4 utilize the input-output context window to engage in chain-of-thought reasoning, reminiscent of how people use notepads to support their system 2 thinking. Yet, they remain accurate even when prevented from engaging in chain-of-thought reasoning, indicating that their system-1-like next-word generation processes are more accurate than those of older models. Our findings highlight the value of applying psychological methodologies to study large language models, as this can uncover previously undetected emergent characteristics.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature computational science	Publication Date: Oct 5, 2023
Citations: 47	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT.

Abstract

Talk to us

Similar Papers

More From: Nature computational science

Lead the way for us

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... W Nick Street
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... W Nick Street
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

The Accuracy and Capability of Artificial Intelligence Solutions in Health Care Examinations and Certificates: Systematic Review and Meta-Analysis.
William Joel Waldock ... Ahmad Nabeel
Journal of medical Internet research | VOL. 26
William Joel Waldock, et. al.William Joel Waldock ... Ahmad Nabeel
05 Nov 2024
Journal of medical Internet research | VOL. 26

Evaluating the Performance of Large Language Models in Hematopoietic Stem Cell Transplantation Decision Making
Ivan Civettini ... Carlo Gambacorti-Passerini
Blood | VOL. 142
Ivan Civettini, et. al.Ivan Civettini ... Carlo Gambacorti-Passerini
02 Nov 2023
Blood | VOL. 142

Large Language Models Can Enable Inductive Thematic Analysis of a Social Media Corpus in a Single Prompt: Human Validation Study.
Michael S Deiner ... Urmimala Sarkar
JMIR infodemiology | VOL. 4
Michael S Deiner, et. al.Michael S Deiner ... Urmimala Sarkar
29 Aug 2024
JMIR infodemiology | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT.

Abstract

Talk to us

Similar Papers

More From: Nature computational science