Task-Oriented Self-Imitation Learning for Robotic Autonomous Skill Acquisition

Chenyang Ran,Jianbo Su

doi:10.1142/s0219843624500014

Abstract

The inferior sample efficiency of reinforcement learning (RL) and the requirement for high-quality demonstrations in imitation learning (IL) will hinder their application in real-world robots. To address this challenge, a novel self-evolution framework, named task-oriented self-imitation learning (TOSIL), is proposed. To circumvent external demonstrations, the top-K self-generated trajectories are chosen as expert data from both per-episode exploration and long-term return perspectives. Each transition is assigned a guide reward, which is formulated by these trajectories. The guide rewards update as the agent evolves, encouraging good exploration behaviors. This methodology guarantees that the agent explores in the direction relevant to the task, improving sample efficiency and asymptotic performance. The experimental results on locomotion and manipulation tasks indicate that the proposed framework outperforms other state-of-the-art RL methods. Furthermore, the integration of suboptimal trajectories has the potential to improve the sample efficiency while maintaining performance. This is a significant advancement in autonomous skill acquisition for robots.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Task-Oriented Self-Imitation Learning for Robotic Autonomous Skill Acquisition

Abstract

Talk to us

Similar Papers

More From: International Journal of Humanoid Robotics

Lead the way for us

Similar Papers

Task-Oriented Deep Reinforcement Learning for Robotic Skill Acquisition and Control.
Guofei Xiang ... Jianbo Su
IEEE Transactions on Cybernetics | VOL. 51
Guofei Xiang, et. al.Guofei Xiang ... Jianbo Su
15 Jan 2021
IEEE Transactions on Cybernetics | VOL. 51

Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model
Thanh Nguyen ... Tung M Luu
-
Thanh Nguyen, et. al.Thanh Nguyen ... Tung M Luu
27 Sep 2021
27 Sep 2021

Recruitment-imitation mechanism for evolutionary reinforcement learning
Shuai Lü ... Junwei Zhang
Information Sciences | VOL. 553
Shuai Lü, et. al.Shuai Lü ... Junwei Zhang
15 Dec 2020
Information Sciences | VOL. 553

Supervised Meta-Reinforcement Learning with Trajectory Optimization for Manipulation Tasks
Lei Wang ... Delong Zhu
IEEE Transactions on Cognitive and Developmental Systems | VOL. -
Lei Wang, et. al.Lei Wang ... Delong Zhu
01 Jan 2024
IEEE Transactions on Cognitive and Developmental Systems | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Task-Oriented Self-Imitation Learning for Robotic Autonomous Skill Acquisition

Abstract

Talk to us

Similar Papers

More From: International Journal of Humanoid Robotics