Compound Heuristic Information Guided Policy Improvement for Robot Motor Skill Acquisition

Jian Fu,Boqun Li,Xiang Teng,Fan Luo,Cong Li

doi:10.3390/app10155346

Abstract

Discovering the implicit pattern and using it as heuristic information to guide the policy search is one of the core factors to speed up the procedure of robot motor skill acquisition. This paper proposes a compound heuristic information guided reinforcement learning algorithm PI2-CMA-KCCA for policy improvement. Its structure and workflow are similar to a double closed-loop control system. The outer loop realized by Kernel Canonical Correlation Analysis (KCCA) infers the implicit nonlinear heuristic information between the joints of the robot. In addition, the inner loop operated by Covariance Matrix Adaptation (CMA) discovers the hidden linear correlations between the basis functions within the joint of the robot. These patterns which are good for learning the new task can automatically determine the mean and variance of the exploring perturbation for Path Integral Policy Improvement (PI2). Compared with classical PI2, PI2-CMA, and PI2-KCCA, PI2-CMA-KCCA can not only endow the robot with the ability to realize transfer learning of trajectory planning from the demonstration to the new task, but also complete it more efficiently. The classical via-point experiments based on SCARA and Swayer robots have validated that the proposed method has fast learning convergence and can find a solution for the new task.

Highlights

Imitation learning (IL) and reinforcement learning (RL) [1] have always been a hot topic in the field of robot skill acquisition
The combination of IL and RL aims to use the advantages of two methods to overcome their respective shortcomings, so that the robot can adapt to the deviation from the demonstration behavior, so as to improve the performance of the robot
Together with our previous research on Kernel Canonical Correlation Analysis (KCCA) [12], we propose a new algorithm PI2 -Covariance Matrix Adaptation (CMA)-KCCA in this paper, where KCCA and CMA are integrated as compound heuristic information to speed up the learning procedure from the demonstration to a new task

Summary

Introduction

Imitation learning (IL) and reinforcement learning (RL) [1] have always been a hot topic in the field of robot skill acquisition. When the reproduction environment is different from the demonstration environment or there is a big deviation, such as placing an obstacle on the path of the robot, the imitation learning method may fail. Together with our previous research on KCCA [12], we propose a new algorithm PI2 -CMA-KCCA in this paper, where KCCA and CMA are integrated as compound heuristic information to speed up the learning procedure from the demonstration to a new task.

Dynamic Movement Primitives

Path Integral Policy Improvement with Covariance Matrix Adaption

PI2 -CMA with Kernel Canonical Correlation Analysis

Nonlinear Correlation Heuristic Information

Robot Intelligent Trajectory Inference with KCCA

The Combination of KCCA and CMA

Evaluations

Passing through One Via-Point with SCARA

Passing through Two Via-Point with SCARA

Passing Through One Via-Point with Swayer

Performance Comparison of Four Algorithms

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Aug 3, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Compound Heuristic Information Guided Policy Improvement for Robot Motor Skill Acquisition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

New feature extraction method and its application to pattern recognition
Zong-Li Liu ... Yuan-Hong Hao
Journal of Computer Applications | VOL. 29
Zong-Li Liu, et. al.Zong-Li Liu ... Yuan-Hong Hao
27 May 2009
Journal of Computer Applications | VOL. 29

A JPEG image blind steganography detection method using KCCA feature fusion
Jian Yang ... Shang-Ping Zhong
-
Jian Yang, et. al.Jian Yang ... Shang-Ping Zhong
01 Jul 2012
01 Jul 2012

Nonlinear feature selection based on hybrid KCCA-FNN algorithm for modeling
Jun Yi ... Su Yingying
-
Jun Yi, et. al.Jun Yi ... Su Yingying
01 May 2011
01 May 2011

Isointense Infant Brain Segmentation by Stacked Kernel Canonical Correlation Analysis.
Li Wang ... Weili Lin
Patch-based techniques in medical imaging : First International Workshop, Patch-MI 2015, held in conjunction with MICCAI 2015, Munich, Germany, October 9, 2015, revised selected papers. Patch-MI (Workshop) (1st : 2015 : Munich, Germany) | VOL. 9467
Li Wang, et. al.Li Wang ... Weili Lin
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compound Heuristic Information Guided Policy Improvement for Robot Motor Skill Acquisition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences