Demonstration-enhanced policy search for space multi-arm robot collaborative skill learning

Tian Gao,Chengfei Yue,Xiaozhe Ju,Tao Lin

doi:10.1016/j.cja.2024.08.018

Abstract

The increasing complexity of on-orbit tasks imposes great demands on the flexible operation of space robotic arms, prompting the development of space robots from single-arm manipulation to multi-arm collaboration. In this paper, a combined approach of Learning from Demonstration (LfD) and Reinforcement Learning (RL) is proposed for space multi-arm collaborative skill learning. The combination effectively resolves the trade-off between learning efficiency and feasible solution in LfD, as well as the time-consuming pursuit of the optimal solution in RL. With the prior knowledge of LfD, space robotic arms can achieve efficient guided learning in high-dimensional state-action space. Specifically, an LfD approach with Probabilistic Movement Primitives (ProMP) is firstly utilized to encode and reproduce the demonstration actions, generating a distribution as the initialization of policy. Then in the RL stage, a Relative Entropy Policy Search (REPS) algorithm modified in continuous state-action space is employed for further policy improvement. More importantly, the learned behaviors can maintain and reflect the characteristics of demonstrations. In addition, a series of supplementary policy search mechanisms are designed to accelerate the exploration process. The effectiveness of the proposed method has been verified both theoretically and experimentally. Moreover, comparisons with state-of-the-art methods have confirmed the outperformance of the approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Demonstration-enhanced policy search for space multi-arm robot collaborative skill learning

Abstract

Talk to us

Similar Papers

More From: Chinese Journal of Aeronautics

Lead the way for us

Journal: Chinese Journal of Aeronautics	Publication Date: Aug 1, 2024
License type: cc-by-nc-nd

Similar Papers

Learning robot motion control with demonstration and advice-operators
B.D Argall ... B Browning
-
B.D Argall, et. al.B.D Argall ... B Browning
01 Sep 2008
01 Sep 2008

Accelerating Robot Trajectory Learning for Stochastic Tasks
Josip Vidakovic ... Bojan Sekoranja
IEEE Access | VOL. 8
Josip Vidakovic, et. al.Josip Vidakovic ... Bojan Sekoranja
01 Jan 2020
IEEE Access | VOL. 8

Automatic task decomposition and state abstraction from demonstration
...
-
, et. al. ...
04 Jun 2012
04 Jun 2012

A Survey on Constraining Policy Updates Using the KL Divergence
Daniel Palenicek
-
Daniel PalenicekDaniel Palenicek
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Demonstration-enhanced policy search for space multi-arm robot collaborative skill learning

Abstract

Talk to us

Similar Papers

More From: Chinese Journal of Aeronautics