HMRL

Yun Hua,Bo Jin,Wenhao Li,Xiaofeng He,Hongyuan Zha,Xiangfeng Wang,Junchi Yan

doi:10.1145/3447548.3467242

Abstract

In spite of the success of existing meta reinforcement learning methods, they still have difficulty in learning a meta policy effectively for RL problems with sparse reward. In this respect, we develop a novel meta reinforcement learning framework called Hyper-Meta RL(HMRL), for sparse reward RL problems. It is consisted with three modules including the cross-environment meta state embedding module which constructs a common meta state space to adapt to different environments; the meta state based environment-specific meta reward shaping which effectively extends the original sparse reward trajectory by cross-environmental knowledge complementarity and as a consequence the meta policy achieves better generalization and efficiency with the shaped meta reward. Experiments with sparse-reward environments show the superiority of HMRL on both transferability and policy learning efficiency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HMRL

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Meta Reinforcement Learning with Generative Adversarial Reward from Expert Knowledge
Dongzi Wang ... Bo Ding
-
Dongzi Wang, et. al.Dongzi Wang ... Bo Ding
27 Sep 2020
27 Sep 2020

Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo ... Honglak Lee
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Yijie Guo, et. al.Yijie Guo ... Honglak Lee
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Data-efficient Deep Reinforcement Learning Method Toward Scaling Continuous Robotic Task with Sparse Rewards
Junkai Ren ... Yujun Zeng
-
Junkai Ren, et. al.Junkai Ren ... Yujun Zeng
15 Jul 2021
15 Jul 2021

Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards
Guoyu Zuo ... Jiangeng Li
International Journal of Advanced Robotic Systems | VOL. 17
Guoyu Zuo, et. al.Guoyu Zuo ... Jiangeng Li
01 Jan 2020
International Journal of Advanced Robotic Systems | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HMRL

Abstract

Talk to us

Similar Papers