A bio-inspired reinforcement learning model that accounts for fast adaptation after punishment

Eric Chalmers,Artur Luczak

doi:10.1016/j.nlm.2024.107974

Abstract

Humans and animals can quickly learn a new strategy when a previously-rewarding strategy is punished. It is difficult to model this with reinforcement learning methods, because they tend to perseverate on previously-learned strategies − a hallmark of impaired response to punishment. Past work has addressed this by augmenting conventional reinforcement learning equations with ad hoc parameters or parallel learning systems. This produces reinforcement learning models that account for reversal learning, but are more abstract, complex, and somewhat detached from neural substrates. Here we use a different approach: we generalize a recently-discovered neuron-level learning rule, on the assumption that it captures a basic principle of learning that may occur at the whole-brain-level. Surprisingly, this gives a new reinforcement learning rule that accounts for adaptation and lose-shift behavior, and uses only the same parameters as conventional reinforcement learning equations. In the new rule, the normal reward prediction errors that drive reinforcement learning are scaled by the likelihood the agent assigns to the action that triggered a reward or punishment. The new rule demonstrates quick adaptation in card sorting and variable Iowa gambling tasks, and also exhibits a human-like paradox-of-choice effect. It will be useful for experimental researchers modeling learning and behavior.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A bio-inspired reinforcement learning model that accounts for fast adaptation after punishment

Abstract

Talk to us

Similar Papers

More From: Neurobiology of Learning and Memory

Lead the way for us

Similar Papers

Reinforcement Learning for Clinical Applications.
Kia Khezeli ... Benjamin Shickel
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18
Kia Khezeli, et. al.Kia Khezeli ... Benjamin Shickel
08 Feb 2023
Clinical journal of the American Society of Nephrology : CJASN | VOL. 18

Author response: Associability-modulated loss learning is increased in posttraumatic stress disorder
Vanessa M Brown ... John M Wang
-
Vanessa M Brown, et. al.Vanessa M Brown ... John M Wang
19 Oct 2017
19 Oct 2017

Metaverse Simulation Based on VR, Blockchain, and Reinforcement Learning Model
Aryan Bagade ... Prof Rupesh Jaiswal Chandrakant
International Journal for Research in Applied Science and Engineering Technology | VOL. 10
Aryan Bagade, et. al.Aryan Bagade ... Prof Rupesh Jaiswal Chandrakant
31 Oct 2022
International Journal for Research in Applied Science and Engineering Technology | VOL. 10

Process Simulation and Optimization of Hydrogen Liquification Using Reinforcement Learning
C. Santiago ... M. Zirrahi
-
C. Santiago, et. al.C. Santiago ... M. Zirrahi
22 Apr 2024
22 Apr 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A bio-inspired reinforcement learning model that accounts for fast adaptation after punishment

Abstract

Talk to us

Similar Papers

More From: Neurobiology of Learning and Memory