Inductive reasoning in humans and large language models

Simon Jerome Han,Keith J Ransom,Andrew Perfors,Charles Kemp

doi:10.1016/j.cogsys.2023.101155

Simon Jerome Han, Keith J Ransom + Show 2 more

Open Access

https://doi.org/10.1016/j.cogsys.2023.101155

Copy DOI

Journal: Cognitive Systems Research	Publication Date: Aug 9, 2023
Citations: 9	License type: cc-by

Affiliation: University of Melbourne

Abstract

The impressive recent performance of large language models has led many to wonder to what extent they can serve as models of general intelligence or are similar to human cognition. We address this issue by applying GPT-3.5 and GPT-4 to a classic problem in human inductive reasoning known as property induction. Over two experiments, we elicit human judgments on a range of property induction tasks spanning multiple domains. Although GPT-3.5 struggles to capture many aspects of human behavior, GPT-4 is much more successful: for the most part, its performance qualitatively matches that of humans, and the only notable exception is its failure to capture the phenomenon of premise non-monotonicity. Our work demonstrates that property induction allows for interesting comparisons between human and machine intelligence and provides two large datasets that can serve as benchmarks for future work in this vein.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Inductive reasoning in humans and large language models

Abstract

Talk to us

Similar Papers

More From: Cognitive Systems Research

Lead the way for us

Similar Papers

Performance of Large Language Models on a Neurology Board–Style Examination
Marc Cicero Schubert ... Varun Venkataramani
JAMA network open | VOL. 6
Marc Cicero Schubert, et. al.Marc Cicero Schubert ... Varun Venkataramani
07 Dec 2023
JAMA network open | VOL. 6

Legal aspects of generative artificial intelligence and large language models in examinations and theses.
Maren März ... Alexander Oksche
GMS journal for medical education | VOL. 41
Maren März, et. al.Maren März ... Alexander Oksche
01 Jan 2024
GMS journal for medical education | VOL. 41

Dermatological Knowledge and Image Analysis Performance of Large Language Models Based on Specialty Certificate Examination in Dermatology
Ka Siu Fan ... Ka Hay Fan
Dermato | VOL. 4
Ka Siu Fan, et. al.Ka Siu Fan ... Ka Hay Fan
30 Sep 2024
Dermato | VOL. 4

Evaluating the Performance of Large Language Models in Hematopoietic Stem Cell Transplantation Decision Making
Ivan Civettini ... Paola Perfetti
Blood | VOL. 142
Ivan Civettini, et. al.Ivan Civettini ... Paola Perfetti
02 Nov 2023
Blood | VOL. 142

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inductive reasoning in humans and large language models

Abstract

Talk to us

Similar Papers

More From: Cognitive Systems Research