Temporally extended successor feature neural episodic control

Xianchao Zhu

doi:10.1038/s41598-024-65687-w

Abstract

One of the long-term goals of reinforcement learning is to build intelligent agents capable of rapidly learning and flexibly transferring skills, similar to humans and animals. In this paper, we introduce an episodic control framework based on the temporal expansion of subsequent features to achieve these goals, which we refer to as Temporally Extended Successor Feature Neural Episodic Control (TESFNEC). This method has shown impressive results in significantly improving sample efficiency and elegantly reusing previously learned strategies. Crucially, this model enhances agent training by incorporating episodic memory, significantly reducing the number of iterations required to learn the optimal policy. Furthermore, we adopt the temporal expansion of successor features a technique to capture the expected state transition dynamics of actions. This form of temporal abstraction does not entail learning a top-down hierarchy of task structures but focuses on the bottom-up combination of actions and action repetitions. Thus, our approach directly considers the temporal scope of sequences of temporally extended actions without requiring predefined or domain-specific options. Experimental results in the two-dimensional object collection environment demonstrate that the method proposed in this paper optimizes learning policies faster than baseline reinforcement learning approaches, leading to higher average returns.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporally extended successor feature neural episodic control

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Journal: Scientific Reports	Publication Date: Jul 2, 2024
License type: CC BY 4.0

Similar Papers

Neural networks for control
Johan A. K. Suykens ... Bart L. R. De Moor
-
Johan A. K. Suykens, et. al.Johan A. K. Suykens ... Bart L. R. De Moor
01 Jan 1996
01 Jan 1996

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

<title>Neural control of helicopter blade-vortex interaction noise</title>
Holger Glaessel ... Stephan Rudolph
-
Holger Glaessel, et. al.Holger Glaessel ... Stephan Rudolph
14 Jun 2001
14 Jun 2001

Reinforcement Learning Based Neural Controllers for Dynamic Processes without Exploration
Frank-Florian Steege ... André Hartmann
-
Frank-Florian Steege, et. al.Frank-Florian Steege ... André Hartmann
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporally extended successor feature neural episodic control

Abstract

Talk to us

Similar Papers

More From: Scientific Reports