Learning Functionally Decomposed Hierarchies for Continuous Control Tasks With Path Planning

Sammy Christen,Emre Aksan,Lukas Jendele,Otmar Hilliges

doi:10.1109/lra.2021.3060403

Abstract

We present HiDe, a novel hierarchical reinforcement learning architecture that successfully solves long horizon control tasks and generalizes to unseen test scenarios. Functional decomposition between planning and low-level control is achieved by explicitly separating the state-action spaces across the hierarchy, which allows the integration of task-relevant knowledge per layer. We propose an RL-based planner to efficiently leverage the information in the planning layer of the hierarchy, while the control layer learns a goal-conditioned control policy. The hierarchy is trained jointly but allows for the modular transfer of policy layers across hierarchies of different agents. We experimentally show that our method generalizes across unseen test environments and can scale to 3x horizon length compared to both learning and non-learning based methods. We evaluate on complex continuous control tasks with sparse rewards, including navigation and robot manipulation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Robotics and Automation Letters	Publication Date: Apr 1, 2021
Citations: 12	License type: other-oa

R Discovery Prime

R Discovery Prime

Learning Functionally Decomposed Hierarchies for Continuous Control Tasks With Path Planning

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Similar Papers

Deep Reinforcement Learning for Robotic Hand Manipulation
Muhammed Saeed ... Mohammed Nagdi
-
Muhammed Saeed, et. al.Muhammed Saeed ... Mohammed Nagdi
26 Feb 2021
26 Feb 2021

Exploration in Continuous Control Tasks via Continually Parameterized Skills
Michael Dann ... Fabio Zambetta
IEEE Transactions on Games | VOL. 10
Michael Dann, et. al.Michael Dann ... Fabio Zambetta
01 Dec 2018
IEEE Transactions on Games | VOL. 10

The Effect of An Internal Model on Subsequent Learning
Ray E Eberts
Proceedings of the Human Factors Society Annual Meeting | VOL. 27
Ray E EbertsRay E Eberts
01 Oct 1983
Proceedings of the Human Factors Society Annual Meeting | VOL. 27

Anchor: The achieved goal to replace the subgoal for hierarchical reinforcement learning
Ruijia Li ... William Zhu
Knowledge-Based Systems | VOL. 225
Ruijia Li, et. al.Ruijia Li ... William Zhu
08 May 2021
Knowledge-Based Systems | VOL. 225

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Functionally Decomposed Hierarchies for Continuous Control Tasks With Path Planning

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters