Asymmetric and adaptive reward coding via normalized reinforcement learning.

Kenway Louie

doi:10.1371/journal.pcbi.1010350

Abstract

Learning is widely modeled in psychology, neuroscience, and computer science by prediction error-guided reinforcement learning (RL) algorithms. While standard RL assumes linear reward functions, reward-related neural activity is a saturating, nonlinear function of reward; however, the computational and behavioral implications of nonlinear RL are unknown. Here, we show that nonlinear RL incorporating the canonical divisive normalization computation introduces an intrinsic and tunable asymmetry in prediction error coding. At the behavioral level, this asymmetry explains empirical variability in risk preferences typically attributed to asymmetric learning rates. At the neural level, diversity in asymmetries provides a computational mechanism for recently proposed theories of distributional RL, allowing the brain to learn the full probability distribution of future rewards. This behavioral and computational flexibility argues for an incorporation of biologically valid value functions in computational models of learning and decision-making.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS Computational Biology	Publication Date: Jul 21, 2022
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Asymmetric and adaptive reward coding via normalized reinforcement learning.

Abstract

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Asymmetric and adaptive reward coding via normalized reinforcement learning
Samuel J Gershman ... Kenway Louie
-
Samuel J Gershman, et. al.Samuel J Gershman ... Kenway Louie
21 Jul 2022
21 Jul 2022

Biped dynamic walking using reinforcement learning
Hamid Benbrahim ... Judy A Franklin
Robotics and Autonomous Systems | VOL. 22
Hamid Benbrahim, et. al.Hamid Benbrahim ... Judy A Franklin
01 Dec 1997
Robotics and Autonomous Systems | VOL. 22

Reinforcement Learning and Dopamine in Schizophrenia: Dimensions of Symptoms or Specific Features of a Disease Group?
Lorenz Deserno ... Florian Schlagenhauf
Frontiers in Psychiatry | VOL. 4
Lorenz Deserno, et. al.Lorenz Deserno ... Florian Schlagenhauf
01 Jan 2013
Frontiers in Psychiatry | VOL. 4

E xploration E xploitation Problem in Policy Based Deep Reinforcement Learning for Episodic and Continuous Environments
Vedang Naik ... Rohit Sahoo
International Journal of Engineering and Advanced Technology | VOL. 11
Vedang Naik, et. al.Vedang Naik ... Rohit Sahoo
30 Dec 2021
International Journal of Engineering and Advanced Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Asymmetric and adaptive reward coding via normalized reinforcement learning.

Abstract

Talk to us

Similar Papers

More From: PLOS Computational Biology