Reinforcement Learning of Targeted Movement in a Spiking Neuronal Model of Motor Cortex

George L Chadderdon,Samuel A Neymotin,Cliff C Kerr,William W Lytton

doi:10.1371/journal.pone.0047251

Abstract

Sensorimotor control has traditionally been considered from a control theory perspective, without relation to neurobiology. In contrast, here we utilized a spiking-neuron model of motor cortex and trained it to perform a simple movement task, which consisted of rotating a single-joint “forearm” to a target. Learning was based on a reinforcement mechanism analogous to that of the dopamine system. This provided a global reward or punishment signal in response to decreasing or increasing distance from hand to target, respectively. Output was partially driven by Poisson motor babbling, creating stochastic movements that could then be shaped by learning. The virtual forearm consisted of a single segment rotated around an elbow joint, controlled by flexor and extensor muscles. The model consisted of 144 excitatory and 64 inhibitory event-based neurons, each with AMPA, NMDA, and GABA synapses. Proprioceptive cell input to this model encoded the 2 muscle lengths. Plasticity was only enabled in feedforward connections between input and output excitatory units, using spike-timing-dependent eligibility traces for synaptic credit or blame assignment. Learning resulted from a global 3-valued signal: reward (+1), no learning (0), or punishment (−1), corresponding to phasic increases, lack of change, or phasic decreases of dopaminergic cell firing, respectively. Successful learning only occurred when both reward and punishment were enabled. In this case, 5 target angles were learned successfully within 180 s of simulation time, with a median error of 8 degrees. Motor babbling allowed exploratory learning, but decreased the stability of the learned behavior, since the hand continued moving after reaching the target. Our model demonstrated that a global reinforcement signal, coupled with eligibility traces for synaptic plasticity, can train a spiking sensorimotor network to perform goal-directed motor behavior.

Highlights

Sensorimotor mappings, for example between proprioceptive input and motor output, are the basis for directed behavior, including foraging, locomotion, and object manipulation
Neuron Model Individual neurons were modeled as event-driven, rule-based dynamical units with many of the key features found in real neurons, including adaptation, bursting, depolarization blockade, and voltage-sensitive NMDA conductance [35,36,37,38,39,40]
Relative refractory period was simulated after an action potential by increasing the firing threshold Vm crossed spiking threshold (Vth) by WRR(Vblock{Vth), where WRR was a unitless weight parameter

Summary

Introduction

Sensorimotor mappings, for example between proprioceptive input and motor output, are the basis for directed behavior, including foraging, locomotion, and object manipulation. We simulated a potential mechanism for the learning of sensorimotor mappings, using a biologically-inspired computational model consisting of spiking neuronal units whose synaptic weights are trained via global reward and punisher signals.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Oct 19, 2012
Citations: 83	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Reinforcement Learning of Targeted Movement in a Spiking Neuronal Model of Motor Cortex

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Reinforcement learning of 2-joint virtual arm reaching in motor cortex simulation
Samuel A Neymotin ... William W Lytton
BMC Neuroscience | VOL. 13
Samuel A Neymotin, et. al.Samuel A Neymotin ... William W Lytton
01 Jul 2012
BMC Neuroscience | VOL. 13

A reinforcement learning algorithm for spiking neural networks
R.V Florian
-
R.V FlorianR.V Florian
01 Jan 2004
01 Jan 2004

Association of Pectoralis Minor Muscle Extensibility, Shoulder Mobility, and Duration of Manual Wheelchair Use
Margaret A Finley ... David Ebaugh
Archives of Physical Medicine and Rehabilitation | VOL. 98
Margaret A Finley, et. al.Margaret A Finley ... David Ebaugh
30 Apr 2017
Archives of Physical Medicine and Rehabilitation | VOL. 98

Sarcomere length organization as a design for cooperative function amongst all lumbar spine muscles
Derek P Zwambag ... Stephen H.M Brown
Journal of Biomechanics | VOL. 47
Derek P Zwambag, et. al.Derek P Zwambag ... Stephen H.M Brown
05 Jul 2014
Journal of Biomechanics | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning of Targeted Movement in a Spiking Neuronal Model of Motor Cortex

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE