Contextual influence on confidence judgments in human reinforcement learning.

Maël Lebreton,Jan B Engelmann,Stefano Palminteri,Karin Bacily

doi:10.1371/journal.pcbi.1006973

Abstract

The ability to correctly estimate the probability of one’s choices being correct is fundamental to optimally re-evaluate previous choices or to arbitrate between different decision strategies. Experimental evidence nonetheless suggests that this metacognitive process—confidence judgment- is susceptible to numerous biases. Here, we investigate the effect of outcome valence (gains or losses) on confidence while participants learned stimulus-outcome associations by trial-and-error. In two experiments, participants were more confident in their choices when learning to seek gains compared to avoiding losses, despite equal difficulty and performance between those two contexts. Computational modelling revealed that this bias is driven by the context-value, a dynamically updated estimate of the average expected-value of choice options, necessary to explain equal performance in the gain and loss domain. The biasing effect of context-value on confidence, revealed here for the first time in a reinforcement-learning context, is therefore domain-general, with likely important functional consequences. We show that one such consequence emerges in volatile environments, where the (in)flexibility of individuals’ learning strategies differs when outcomes are framed as gains or losses. Despite apparent similar behavior- profound asymmetries might therefore exist between learning to avoid losses and learning to seek gains.

Highlights

Simple reinforcement learning algorithms efficiently learn by trial-and-error to implement decision policies that maximize the occurrence of rewards and minimize the occurrence of punishments [1]
In order to arbitrate between different decision strategies, as well as to inform future choices, a decision maker needs to estimate the probability of her choices being correct as precisely as possible
We show that individuals are more confident in their choices when learning to seek gains compared to avoiding losses, despite equal difficulty and performance between those two contexts

Summary

Introduction

Simple reinforcement learning algorithms efficiently learn by trial-and-error to implement decision policies that maximize the occurrence of rewards and minimize the occurrence of punishments [1]. Ecological environments are inherently ever-changing, volatile and complex, such that organisms need to be able to flexibly adjust their learning strategies or to dynamically select among different learning strategies. These more sophisticated behaviors can be implemented by reinforcement-learning algorithms which compute different measures of environmental uncertainty [10,11,12] or strategy reliability [13,14,15]. Despite the recent surge of neural, computational and behavioral models of confidence estimation in decision-making and prediction tasks [17,23,24], how decision-makers estimate their confidence in their choices in reinforcement-learning contexts remains poorly investigated

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Apr 8, 2019
Citations: 49	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Contextual influence on confidence judgments in human reinforcement learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Robust valence-induced biases on motor response and confidence in human reinforcement learning
Chih-Chung Ting ... Stefano Palminteri
Cognitive, Affective, & Behavioral Neuroscience | VOL. 20
Chih-Chung Ting, et. al.Chih-Chung Ting ... Stefano Palminteri
01 Sep 2020
Cognitive, Affective, & Behavioral Neuroscience | VOL. 20

Single dose of a dopamine agonist impairs reinforcement learning in humans: Evidence from event‐related potentials and computational modeling of striatal‐cortical function
Diane L Santesso ... A Eden Evins
Human Brain Mapping | VOL. 30
Diane L Santesso, et. al.Diane L Santesso ... A Eden Evins
22 Aug 2008
Human Brain Mapping | VOL. 30

The two-stage processing of judgment of confidence: evidence from ERP.
Zhaolan Li ... Ning Jia
BMC psychology | VOL. 12
Zhaolan Li, et. al.Zhaolan Li ... Ning Jia
13 Nov 2024
BMC psychology | VOL. 12

On the role of temporal context in human reinforcement learning

-

01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contextual influence on confidence judgments in human reinforcement learning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology