A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation.

M Ganesh Kumar,Camilo Libedinsky,Shih-Cheng Yen,Cheston Tan,Andrew Y Y Tan

doi:10.1093/cercor/bhab456

Abstract

Navigation to multiple cued reward locations has been increasingly used to study rodent learning. Though deep reinforcement learning agents have been shown to be able to learn the task, they are not biologically plausible. Biologically plausible classic actor-critic agents have been shown to learn to navigate to single reward locations, but which biologically plausible agents are able to learn multiple cue-reward location tasks has remained unclear. In this computational study, we show versions of classic agents that learn to navigate to a single reward location, and adapt to reward location displacement, but are not able to learn multiple paired association navigation. The limitation is overcome by an agent in which place cell and cue information are first processed by a feedforward nonlinear hidden layer with synapses to the actor and critic subject to temporal difference error-modulated plasticity. Faster learning is obtained when the feedforward layer is replaced by a recurrent reservoir network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation.

Abstract

Talk to us

Similar Papers

More From: Cerebral cortex (New York, N.Y. : 1991)

Lead the way for us

Journal: Cerebral cortex (New York, N.Y. : 1991)	Publication Date: Jan 17, 2022
Citations: 1

Similar Papers

Different encoding of reward location in dorsal and intermediate hippocampus
Przemyslaw Jarzebowski ... Ole Paulsen
Current Biology | VOL. 32
Przemyslaw Jarzebowski, et. al.Przemyslaw Jarzebowski ... Ole Paulsen
10 Jan 2022
Current Biology | VOL. 32

Conjunctive reward-place coding properties of dorsal distal CA1 hippocampus cells.
Zhuocheng Xiao ... Kevin Lin
Biological Cybernetics | VOL. 114
Zhuocheng Xiao, et. al.Zhuocheng Xiao ... Kevin Lin
01 Apr 2020
Biological Cybernetics | VOL. 114

Author response: Neural learning rules for generating flexible predictions and computing the successor representation
Ching Fang ... Dmitriy Aronov
-
Ching Fang, et. al.Ching Fang ... Dmitriy Aronov
12 Oct 2022
12 Oct 2022

Editor's evaluation: Neural learning rules for generating flexible predictions and computing the successor representation
Srdjan Ostojic
-
Srdjan OstojicSrdjan Ostojic
29 Aug 2022
29 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A nonlinear hidden layer enables actor-critic agents to learn multiple paired association navigation.

Abstract

Talk to us

Similar Papers

More From: Cerebral cortex (New York, N.Y. : 1991)