Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection.

Yasumasa Ueda,Naoyuki Matsumoto,Minoru Kimura,Ko Yamanaka,Kazuki Enomoto,Atsushi Noritake,Kazuyuki Samejima,Hiroshi Yamada,Kae Nakamura,Yukiko Hori,Hitoshi Inokawa

doi:10.3389/fnana.2017.00066

Abstract

Cortico-basal ganglia circuits are critical regulators of reward-based decision making. Reinforcement learning models posit that action reward value is encoded by the firing activity of striatal medium spiny neurons (MSNs) and updated upon changing reinforcement contingencies by dopamine (DA) signaling to these neurons. However, it remains unclear how the anatomically distinct direct and indirect pathways through the basal ganglia are involved in updating action reward value under changing contingencies. MSNs of the direct pathway predominantly express DA D1 receptors and those of the indirect pathway predominantly D2 receptors, so we tested for distinct functions in behavioral adaptation by injecting D1 and D2 receptor antagonists into the putamen of two macaque monkeys performing a free choice task for probabilistic reward. In this task, monkeys turned a handle toward either a left or right target depending on an asymmetrically assigned probability of large reward. Reward probabilities of left and right targets changed after 30–150 trials, so the monkeys were required to learn the higher-value target choice based on action–outcome history. In the control condition, the monkeys showed stable selection of the higher-value target (that more likely to yield large reward) and kept choosing the higher-value target regardless of less frequent small reward outcomes. The monkeys also made flexible changes of selection away from the high-value target when two or three small reward outcomes occurred randomly in succession. DA D1 antagonist injection significantly increased the probability of the monkey switching to the alternate target in response to successive small reward outcomes. Conversely, D2 antagonist injection significantly decreased the switching probability. These results suggest distinct functions of D1 and D2 receptor-mediated signaling processes in action selection based on action–outcome history, with D1 receptor-mediated signaling promoting the stable choice of higher-value targets and D2 receptor-mediated signaling promoting a switch in action away from small reward outcomes. Therefore, direct and indirect pathways appear to have complementary functions in maintaining optimal goal-directed action selection and updating action value, which are dependent on D1 and D2 DA receptor signaling.

Highlights

Humans and non-human animals adapt behavior based on previous experience, choosing actions followed by rewards and avoiding those followed by unfavorable outcomes
To assess possible contributions of the direct and indirect pathways to action selection based on action–outcome history, we examined the responses of monkeys during a rewardbased probabilistic learning paradigm (Samejima et al, 2005) under control conditions, D1 antagonist local infusion, and D2 antagonist local infusion into putamen
The large reward probability for the left target remained 50% while the large reward probability for the right target was changed to 90% (L50%–R90% condition), and the monkey quickly switched to the right target on most trials (Figure 1B)

Summary

INTRODUCTION

Humans and non-human animals adapt behavior based on previous experience, choosing actions followed by rewards and avoiding those followed by unfavorable outcomes. To assess possible contributions of the direct and indirect pathways to action selection based on action–outcome history, we examined the responses of monkeys during a rewardbased probabilistic learning paradigm (Samejima et al, 2005) under control conditions, D1 antagonist local infusion, and D2 antagonist local infusion into putamen. In this task, two alternative choices, lever turn to a left or right target, were associated with predetermined probabilities of large and small reward. We found that D1 and D2 receptor-mediated signaling mechanisms regulate the balance between stable and flexible action selection to optimize reward

MATERIALS AND METHODS

RESULTS

DISCUSSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neuroanatomy	Publication Date: Aug 3, 2017
Citations: 14	License type: cc-by

R Discovery Prime

R Discovery Prime

Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neuroanatomy

Lead the way for us

Similar Papers

Striatal Plasticity and Basal Ganglia Circuit Function
Anatol C Kreitzer ... Robert C Malenka
Neuron | VOL. 60
Anatol C Kreitzer, et. al.Anatol C Kreitzer ... Robert C Malenka
01 Nov 2008
Neuron | VOL. 60

A Role for Dopamine-Mediated Learning in the Pathophysiology and Treatment of Parkinson’s Disease
Jeff A Beeler ... Xiaoxi Zhuang
Cell Reports | VOL. 2
Jeff A Beeler, et. al.Jeff A Beeler ... Xiaoxi Zhuang
01 Dec 2012
Cell Reports | VOL. 2

Rapid Target-Specific Remodeling of Fast-Spiking Inhibitory Circuits after Loss of Dopamine
Aryn H Gittis ... Anatol C Kreitzer
Neuron | VOL. 71
Aryn H Gittis, et. al.Aryn H Gittis ... Anatol C Kreitzer
01 Sep 2011
Neuron | VOL. 71

Inhibition of phosphodiesterase 10A has differential effects on dopamine D1 and D2 receptor modulation of sensorimotor gating
Jodi E Gresack ... Christopher J Schmidt
Psychopharmacology | VOL. 231
Jodi E Gresack, et. al.Jodi E Gresack ... Christopher J Schmidt
21 Dec 2013
Psychopharmacology | VOL. 231

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Distinct Functions of the Primate Putamen Direct and Indirect Pathways in Adaptive Outcome-Based Action Selection.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neuroanatomy