Dissociation between asymmetric value updating and perseverance in human reinforcement learning

Michiyo Sugawara,Kentaro Katahira

doi:10.1038/s41598-020-80593-7

Michiyo Sugawara, Kentaro Katahira

Open Access

https://doi.org/10.1038/s41598-020-80593-7

Copy DOI

Abstract

The learning rate is a key parameter in reinforcement learning that determines the extent to which novel information (outcome) is incorporated in guiding subsequent actions. Numerous studies have reported that the magnitude of the learning rate in human reinforcement learning is biased depending on the sign of the reward prediction error. However, this asymmetry can be observed as a statistical bias if the fitted model ignores the choice autocorrelation (perseverance), which is independent of the outcomes. Therefore, to investigate the genuine process underlying human choice behavior using empirical data, one should dissociate asymmetry in learning and perseverance from choice behavior. The present study addresses this issue by using a Hybrid model incorporating asymmetric learning rates and perseverance. First, by conducting simulations, we demonstrate that the Hybrid model can identify the true underlying process. Second, using the Hybrid model, we show that empirical data collected from a web-based experiment are governed by perseverance rather than asymmetric learning. Finally, we apply the Hybrid model to two open datasets in which asymmetric learning was reported. As a result, the asymmetric learning rate was validated in one dataset but not another.

Highlights

The learning rate is a key parameter in reinforcement learning that determines the extent to which novel information is incorporated in guiding subsequent actions
We investigated the identifiability of the three models (i.e., Asymmetry, Perseverance, and Hybrid) in each learning context, whether pseudo-asymmetric learning rates and pseudo-perseverance occurred by fitting mismatched models, and whether the Hybrid model could distinguish asymmetric value updating from choice perseveration
This study considered a method to dissociate two factors underlying human choice behavior, i.e., asymmetric learning and choice perseverance

Summary

Introduction

The learning rate is a key parameter in reinforcement learning that determines the extent to which novel information (outcome) is incorporated in guiding subsequent actions. Numerous studies have reported that the magnitude of the learning rate in human reinforcement learning is biased depending on the sign of the reward prediction error This asymmetry can be observed as a statistical bias if the fitted model ignores the choice autocorrelation (perseverance), which is independent of the outcomes. Several modeling studies investigating human choice behavior have reported that the magnitude of the value update is biased depending on the sign of the reward prediction error. This bias can be represented in RL models as asymmetric learning rates for positive and negative outcomes. The identification of computational processes, such as asymmetric value updating and perseverance, is crucial for interpreting neural mechanisms and investigating the association with personality traits in the fields of neuroscience, psychology, and p sychiatry

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Feb 11, 2021
Citations: 25	License type: open-access

R Discovery Prime

R Discovery Prime

Dissociation between asymmetric value updating and perseverance in human reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Author response: DYT1 dystonia increases risk taking in humans
David Arkadir ... Pietro Mazzoni
-
David Arkadir, et. al.David Arkadir ... Pietro Mazzoni
26 Apr 2016
26 Apr 2016

Effect of lysergic acid diethylamide (LSD) on reinforcement learning in humans.
Jonathan W Kanen ... Rudolf N Cardinal
Psychological Medicine | VOL. 53
Jonathan W Kanen, et. al.Jonathan W Kanen ... Rudolf N Cardinal
22 Nov 2022
Psychological Medicine | VOL. 53

Dynamic Flexibility in Striatal-Cortical Circuits Supports Reinforcement Learning.
Raphael T Gerraty ... Adriana Galvan
The Journal of Neuroscience | VOL. 38
Raphael T Gerraty, et. al.Raphael T Gerraty ... Adriana Galvan
05 Feb 2018
The Journal of Neuroscience | VOL. 38

Changes in corticostriatal connectivity during reinforcement learning in humans.
Guillermo Horga ... Gregory Z Tau
Human Brain Mapping | VOL. 36
Guillermo Horga, et. al.Guillermo Horga ... Gregory Z Tau
12 Nov 2014
Human Brain Mapping | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dissociation between asymmetric value updating and perseverance in human reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports