Data-efficient Deep Reinforcement Learning Method Toward Scaling Continuous Robotic Task with Sparse Rewards

Junkai Ren,Yujun Zeng,Yichuan Zhang,Yixing Lan

doi:10.1109/rcar52367.2021.9517647

Abstract

Dealing with the robotic continuous control problem with sparse rewards is a longstanding challenge in deep reinforcement learning (RL). While existing DRL algorithms have demonstrated great progress in learning policies from visual observations, learning effective policies still requires an impractical number of real-world data samples. Moreover, some robotic tasks are naturally specified with sparse rewards, which makes the precious data inefficient and slows down the learning process, making DRL infeasible. In addition, manually shaping reward functions is a complex work because it needs specific domain knowledge and human intervention. To alleviate the issue, this paper proposes a model-free, off-policy RL approach named TD3MHER, to learn the manipulating policy for continuous robotic tasks with sparse rewards. To be specific, TD3MHER utilizes Twin Delayed Deep Deterministic policy gradient algorithm (TD3) and Model-driven Hindsight Experience Replay (MHER) to achieve highly sample-efficient training property. Because while the agent is learning the policy, TD3MHER could also help it to learn the potation physical model of the robot which is helpful to solve the task, and it does not necessitate any novel robot-environment interactions. The performance of TD3MHER is assessed on a simulated robotic task using a 7-DOF manipulator to compare the proposed technique to a previous DRL algorithm and to verify the usefulness of our method. Results of the experiments on simulated robotic task show that the proposed approach is capable of successfully utilizing previously store samples with sparse rewards, and obtain a faster learning speed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data-efficient Deep Reinforcement Learning Method Toward Scaling Continuous Robotic Task with Sparse Rewards

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient hindsight reinforcement learning using demonstrations for robotic tasks with sparse rewards
Guoyu Zuo ... Jiangeng Li
International Journal of Advanced Robotic Systems | VOL. 17
Guoyu Zuo, et. al.Guoyu Zuo ... Jiangeng Li
01 Jan 2020
International Journal of Advanced Robotic Systems | VOL. 17

Reinforcement learning based on movement primitives for contact tasks
Young-Loul Kim ... Jae-Bok Song
Robotics and Computer-Integrated Manufacturing | VOL. 62
Young-Loul Kim, et. al.Young-Loul Kim ... Jae-Bok Song
26 Sep 2019
Robotics and Computer-Integrated Manufacturing | VOL. 62

Morphing control of a new bionic morphing UAV with deep reinforcement learning
Dan Xu ... Gang Chen
Aerospace Science and Technology | VOL. 92
Dan Xu, et. al.Dan Xu ... Gang Chen
28 May 2019
Aerospace Science and Technology | VOL. 92

Method for remaining useful life prediction of rolling bearings based on deep reinforcement learning.
Yipeng Wang ... Denglong Wang
The Review of scientific instruments | VOL. 95
Yipeng Wang, et. al.Yipeng Wang ... Denglong Wang
01 Sep 2024
The Review of scientific instruments | VOL. 95

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data-efficient Deep Reinforcement Learning Method Toward Scaling Continuous Robotic Task with Sparse Rewards

Abstract

Talk to us

Similar Papers