Action selection performance of a reconfigurable basal ganglia inspired model with Hebbian–Bayesian Go-NoGo connectivity

Pierre Berthet,Jeanette Hellgren-Kotaleski,Anders Lansner

doi:10.3389/fnbeh.2012.00065

Pierre Berthet, Jeanette Hellgren-Kotaleski + Show 1 more

Open Access

https://doi.org/10.3389/fnbeh.2012.00065

Copy DOI

Abstract

Several studies have shown a strong involvement of the basal ganglia (BG) in action selection and dopamine dependent learning. The dopaminergic signal to striatum, the input stage of the BG, has been commonly described as coding a reward prediction error (RPE), i.e., the difference between the predicted and actual reward. The RPE has been hypothesized to be critical in the modulation of the synaptic plasticity in cortico-striatal synapses in the direct and indirect pathway. We developed an abstract computational model of the BG, with a dual pathway structure functionally corresponding to the direct and indirect pathways, and compared its behavior to biological data as well as other reinforcement learning models. The computations in our model are inspired by Bayesian inference, and the synaptic plasticity changes depend on a three factor Hebbian–Bayesian learning rule based on co-activation of pre- and post-synaptic units and on the value of the RPE. The model builds on a modified Actor-Critic architecture and implements the direct (Go) and the indirect (NoGo) pathway, as well as the reward prediction (RP) system, acting in a complementary fashion. We investigated the performance of the model system when different configurations of the Go, NoGo, and RP system were utilized, e.g., using only the Go, NoGo, or RP system, or combinations of those. Learning performance was investigated in several types of learning paradigms, such as learning-relearning, successive learning, stochastic learning, reversal learning and a two-choice task. The RPE and the activity of the model during learning were similar to monkey electrophysiological and behavioral data. Our results, however, show that there is not a unique best way to configure this BG model to handle well all the learning paradigms tested. We thus suggest that an agent might dynamically configure its action selection mode, possibly depending on task characteristics and also on how much time is available.

Highlights

When facing a situation where multiple behavioral choices are possible, the action selection process becomes critical
The computations in our model are inspired by Bayesian inference, and the synaptic plasticity changes depend on a three factor Hebbian–Bayesian learning rule based on co-activation of pre- and post-synaptic units and on the value of the reward prediction error (RPE)
Dopamine plays a key role in basal ganglia (BG) functions and is involved in the control of the different pathways (Surmeier et al, 2007), in the modulation of plasticity and learning (Reynolds and Wickens, 2002), and in coding the reward prediction error (RPE) (Montague et al, 1996; Schultz et al, 1997; Schultz and Dickinson, 2000; Daw and Doya, 2006)

Summary

Introduction

When facing a situation where multiple behavioral choices are possible, the action selection process becomes critical. A dual pathway architecture within BG has been described in terms of the direct- and indirect pathways They originate from two different pools of GABAergic medium spiny neurons (MSN) expressing dopamine D1 and D2 receptors respectively (see below). Dopamine plays a key role in BG functions and is involved in the control of the different pathways (Surmeier et al, 2007), in the modulation of plasticity and learning (Reynolds and Wickens, 2002), and in coding the reward prediction error (RPE) (Montague et al, 1996; Schultz et al, 1997; Schultz and Dickinson, 2000; Daw and Doya, 2006). This RPE signal, has been used in the temporal difference

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Behavioral Neuroscience	Publication Date: Jan 1, 2012
Citations: 37	License type: cc-by

R Discovery Prime

R Discovery Prime

Action selection performance of a reconfigurable basal ganglia inspired model with Hebbian–Bayesian Go-NoGo connectivity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Behavioral Neuroscience

Lead the way for us

Similar Papers

Author response: DYT1 dystonia increases risk taking in humans
David Arkadir ... Naomi Lubarr
-
David Arkadir, et. al.David Arkadir ... Naomi Lubarr
26 Apr 2016
26 Apr 2016

Author response: On the normative advantages of dopamine and striatal opponency for learning and choice
Alana Jaskir ... Michael J Frank
-
Alana Jaskir, et. al.Alana Jaskir ... Michael J Frank
14 Feb 2023
14 Feb 2023

增強學習之酬賞預測誤差：精神病、性格及模型探討

-

01 Jan 2012
01 Jan 2012

Enhancing reinforcement learning models by including direct and indirect pathways improves performance on striatal dependent tasks.
Kim T Blackwell ... Kenji Doya
PLOS Computational Biology | VOL. 19
Kim T Blackwell, et. al.Kim T Blackwell ... Kenji Doya
18 Aug 2023
PLOS Computational Biology | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action selection performance of a reconfigurable basal ganglia inspired model with Hebbian–Bayesian Go-NoGo connectivity

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Behavioral Neuroscience