Abstract
Recent experimental results indicate that cells within theprimary visual cortex can learn to predict the time ofrewards associated with visual cues [1]. In this work, dif-ferent visual cues were paired with rewards at specific tem-poral offsets. Before training, neurons in visual cortexwere active only during the duration of the visual cue.After sufficient training neurons developed persistentactivity for a time period correlated with the timing ofreward.Recurrent connections in a neural network can be con-structed to set a desired network time constant that is dif-ferent from the time constants of the constituent neurons.However, it is not known how such a network can learnthe appropriate recurrent weights. A plasticity model thatis able to accomplish this must be sensitive to the timingof reward events that, at least initially, occur seconds afterthe activity in the network returns to its basal level. Inorder to learn the appropriate dynamics, this networkneeds to solve a temporal credit assignment problem. Inour model plasticity is an ongoing process changing therecurrent synaptic weights as a function of their activity; inthe absence of a reward signal this plasticity rapidlydecays. External reward signals allow permanent expres-sion of preceding plasticity events, reinforcing only thosewhich predict the reward. As a result, the network dynam-ics are altered and it develops time constants correlatedwith the timing of different rewards. As in other reinforce-ment learning models the reward signal is inhibited by thenetwork activity to produce a stable activity pattern.We have implemented these ideas in both abstract passiveintegrator networks and in more realistic integrate and firenetworks and obtained results that are qualitatively simi-lar to the experimental results. Further, we examine theimplications of different possible biophysical mecha-nisms and propose experiments to test which specificmechanism are involved.Support: NSF CRCNS grant number 0515285.
Highlights
Sixteenth Annual Computational Neuroscience Meeting: CNS*2007 William R Holmes Meeting abstracts – A single PDF containing all abstracts in this Supplement is available here http://www.biomedcentral.com/content/pdf/1471-2202-8-S2-info.pdf
Recent experimental results indicate that cells within the primary visual cortex can learn to predict the time of rewards associated with visual cues [1]
Different visual cues were paired with rewards at specific temporal offsets
Summary
Sixteenth Annual Computational Neuroscience Meeting: CNS*2007 William R Holmes Meeting abstracts – A single PDF containing all abstracts in this Supplement is available here http://www.biomedcentral.com/content/pdf/1471-2202-8-S2-info.pdf . Address: 1Department of Neurobiology and Anatomy the University of Texas Medical School in Houston, TX, USA, 2Department of Electrical and Computer Engineering the University of Texas. TX, USA and 3Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA Email: Jeffrey P Gavornik* - gavornik@mail.utexas.edu * Corresponding author from Sixteenth Annual Computational Neuroscience Meeting: CNS*2007 Toronto, Canada.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.