Mid-lateral cerebellar complex spikes encode multiple independent reward-related signals during reinforcement learning

Naveen Sendhilnathan,Michael E Goldberg,Anna Ipata

doi:10.1038/s41467-021-26338-0

Naveen Sendhilnathan, Michael E Goldberg + Show 1 more

Open Access

https://doi.org/10.1038/s41467-021-26338-0

Copy DOI

Abstract

Although the cerebellum has been implicated in simple reward-based learning recently, the role of complex spikes (CS) and simple spikes (SS), their interaction and their relationship to complex reinforcement learning and decision making is still unclear. Here we show that in a context where a non-human primate learned to make novel visuomotor associations, classifying CS responses based on their SS properties revealed distinct cell-type specific encoding of the probability of failure after the stimulus onset and the non-human primate’s decision. In a different context, CS from the same cerebellar area also responded in a cell-type and learning independent manner to the stimulus that signaled the beginning of the trial. Both types of CS signals were independent of changes in any motor kinematics and were unlikely to instruct the concurrent SS activity through an error based mechanism, suggesting the presence of context dependent, flexible, multiple independent channels of neural encoding by CS and SS. This diversity in neural information encoding in the mid-lateral cerebellum, depending on the context and learning state, is well suited to promote exploration and acquisition of wide range of cognitive behaviors that entail flexible stimulus-action-reward relationships but not necessarily motor learning.

Highlights

The cerebellum has been implicated in simple reward-based learning recently, the role of complex spikes (CS) and simple spikes (SS), their interaction and their relationship to complex reinforcement learning and decision making is still unclear
Recent evidence suggest that cerebellar activity is correlated with aspects of behavior that do not involve correcting the kinematics of movement: for example classical conditioning[11], stimulus prediction[12,13], and the magnitude of predicted reward[14,15]
After a fixed duration (800 ms), one of the two symbols briefly appeared on the screen and they released the hand associated with that symbol, as soon as possible, with a welllearned stereotypic hand movement to earn a liquid reward (Fig. 1a)

Summary

Results

Two non-human primates performed a two-alternative forcedchoice discrimination task where, in each session, they associated one of two visual symbols with a left-hand movement and the other visual symbol with a right-hand movement[17]. They grabbed the two bars, each with one hand to initiate the trial. We presented them with two novel symbols that they learned to associate with specific choices (hand releases), through trial and error They typically achieved criterion for learning (see Methods) in ~50–70 trials on an average through an adaptive learning mechanism (Fig. 1b). The mid-lateral cerebellar P-cell SS encode a reinforcement error signal when animals learn a new visuomotor association, by reporting the outcome of the most recent decision in short epochs called “delta epochs” in a manner entirely a fixed interval fixed interval

H Eye movement cue1 cue2 sym movt

H V H 1V beg mid end k wP-cells cP-cells correct

Discussion

Methods

Code availability

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature Communications	Publication Date: Nov 9, 2021
Citations: 18	License type: open-access

R Discovery Prime

R Discovery Prime

Mid-lateral cerebellar complex spikes encode multiple independent reward-related signals during reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Communications

Lead the way for us

Similar Papers

The specific origin of the simple and complex spikes in Purkinje neurons
Eric Avila Orozco ... William Vogt
The Journal of Physiology | VOL. 588
Eric Avila Orozco, et. al.Eric Avila Orozco ... William Vogt
14 Oct 2010
The Journal of Physiology | VOL. 588

Differences in responses to 70 dB clicks of cerebellar units with simple versus complex spike activity: (i) In medial and lateral ansiform lobes and flocculus; and (ii) Before and after conditioning blink conditioned responses with clicks as conditioned stimuli
C.D Woody ... E Gruen
Neuroscience | VOL. 90
C.D Woody, et. al.C.D Woody ... E Gruen
29 Mar 1999
Neuroscience | VOL. 90

RAPID REPORT: Initiation of simple and complex spikes in cerebellar Purkinje cells
Lucy M Palmer ... Jan Gründemann
The Journal of Physiology | VOL. 588
Lucy M Palmer, et. al.Lucy M Palmer ... Jan Gründemann
15 May 2010
The Journal of Physiology | VOL. 588

Spontaneous and sound-evoked discharge characteristics of complex-spiking neurons in the dorsal cochlear nucleus of the unanesthetized decerebrate cat.
K Parham ... D O Kim
Journal of neurophysiology | VOL. 73
K Parham, et. al.K Parham ... D O Kim
01 Feb 1995
Journal of neurophysiology | VOL. 73

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Mid-lateral cerebellar complex spikes encode multiple independent reward-related signals during reinforcement learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nature Communications