Abstract

Animals remember temporal links between their actions and subsequent rewards. We previously discovered a synaptic mechanism underlying such reward learning in D1 receptor (D1R)-expressing spiny projection neurons (D1 SPN) of the striatum. Dopamine (DA) bursts promote dendritic spine enlargement in a time window of only a few seconds after paired pre- and post-synaptic spiking (pre-post pairing), which is termed as reinforcement plasticity (RP). The previous study has also identified underlying signaling pathways; however, it still remains unclear how the signaling dynamics results in RP. In the present study, we first developed a computational model of signaling dynamics of D1 SPNs. The D1 RP model successfully reproduced experimentally observed protein kinase A (PKA) activity, including its critical time window. In this model, adenylate cyclase type 1 (AC1) in the spines/thin dendrites played a pivotal role as a coincidence detector against pre-post pairing and DA burst. In particular, pre-post pairing (Ca2+ signal) stimulated AC1 with a delay, and the Ca2+-stimulated AC1 was activated by the DA burst for the asymmetric time window. Moreover, the smallness of the spines/thin dendrites is crucial to the short time window for the PKA activity. We then developed a RP model for D2 SPNs, which also predicted the critical time window for RP that depended on the timing of pre-post pairing and phasic DA dip. AC1 worked for the coincidence detector in the D2 RP model as well. We further simulated the signaling pathway leading to Ca2+/calmodulin-dependent protein kinase II (CaMKII) activation and clarified the role of the downstream molecules of AC1 as the integrators that turn transient input signals into persistent spine enlargement. Finally, we discuss how such timing windows guide animals' reward learning.

Highlights

  • Animals behave in their environments to obtain larger rewards [1]

  • How do animals associate their actions with subsequent rewards? We previously found a synaptic basis for this association

  • Synaptic plasticity in the striatum occurs only if the action signal comes just before dopamine (DA) signal, which is known to encode reward/regret information in the brain

Read more

Summary

Introduction

Animals behave in their environments to obtain larger rewards [1]. During such reward learning, a specific brain region—the striatum—plays a central role in associating the situation/ behavioral action with reward signals [2,3]. One explores the direct effect of DA signal on NMDARs and VGCCs [12], and the other models DA-delay dependence in Ca2+ dynamics through a Ca2+ buffer [13]. Those models predict 100-ms and 20-s time windows, respectively, and it still remains obscure about the mechanisms of the in-between time window of ~2 s, which is required for DA-mediated reward conditioning [14]. A pharmacological experiment showed that RP depends on the other type of AC, Ca2+-sensitive type 1 AC (AC1) [4], which is expressed in the striatum [15]

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call