Learning Trajectory Distributions for Assisted Teleoperation and Path Planning.

Marco Ewerton,Dorothea Koert,Zlatko Kolev,Masaki Takahashi,Jan Peters,Oleg Arenz,Guilherme Maeda

doi:10.3389/frobt.2019.00089

Abstract

Several approaches have been proposed to assist humans in co-manipulation and teleoperation tasks given demonstrated trajectories. However, these approaches are not applicable when the demonstrations are suboptimal or when the generalization capabilities of the learned models cannot cope with the changes in the environment. Nevertheless, in real co-manipulation and teleoperation tasks, the original demonstrations will often be suboptimal and a learning system must be able to cope with new situations. This paper presents a reinforcement learning algorithm that can be applied to such problems. The proposed algorithm is initialized with a probability distribution of demonstrated trajectories and is based on the concept of relevance functions. We show in this paper how the relevance of trajectory parameters to optimization objectives is connected with the concept of Pearson correlation. First, we demonstrate the efficacy of our algorithm by addressing the assisted teleoperation of an object in a static virtual environment. Afterward, we extend this algorithm to deal with dynamic environments by utilizing Gaussian Process regression. The full framework is applied to make a point particle and a 7-DoF robot arm autonomously adapt their movements to changes in the environment as well as to assist the teleoperation of a 7-DoF robot arm in a dynamic environment.

Highlights

Learning from demonstrations is a promising approach toward human-robot co-manipulation and teleoperation
We extend PRO with Gaussian Processes (GP) regression to cope with dynamic environments
To adapt Probabilistic Movement Primitives (ProMPs) on the fly to changes in the environment, our learning system must be able to compute these ProMPs quickly. To deal with this challenge, we propose using Gaussian Process (GP) regression to map variables describing the environment to mean vector μw and covariance matrix w of a ProMP

Summary

INTRODUCTION

Learning from demonstrations is a promising approach toward human-robot co-manipulation and teleoperation. Our work contributes to this field by providing a new reinforcement learning algorithm, Pearson-Correlation-Based Relevance Weighted Policy Optimization (PRO), to improve upon demonstrated trajectories when these are suboptimal or when solutions to new situations must be found. These trajectories need to be optimized with respect to objectives, such as minimizing distances to via points, keeping a certain minimum distance from obstacles, achieving minimal length, minimal jerk, etc. The new algorithm presented in this paper, PRO, is based on the insight that the Pearson correlation coefficient (Benesty et al, 2009) can be used to determine how each trajectory parameter influences each objective It does not require designing basis functions for the relevance. Excerpts of this work have been accepted for presentation at Ewerton et al (2019)

RELATED WORK

Relevance Functions

Optimization of Trajectory

ONLINE ADAPTATION OF TRAJECTORY DISTRIBUTIONS

6: Sample trajectory parameters w from N

EXPERIMENTS

Assisted Teleoperation of a Virtual Object

Adaptation in Dynamic

Adaptation in Dynamic Environments—Autonomous Robot Arm

Teleoperation of a Robot Arm in a Dynamic Environment

CONCLUSION AND FUTURE WORK

DATA AVAILABILITY STATEMENT

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Robotics and AI	Publication Date: Sep 24, 2019
Citations: 18	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Learning Trajectory Distributions for Assisted Teleoperation and Path Planning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI

Lead the way for us

Similar Papers

Bidirectional Human-Robot Learning: Imitation and Skill Improvement
...
-
, et. al. ...
23 Jun 2020
23 Jun 2020

Path planning for autonomous mobile robots using multi-objective evolutionary particle swarm optimization.
Ittikon Thammachantuek ... Seyedali Mirjalili
PloS one | VOL. 17
Ittikon Thammachantuek, et. al.Ittikon Thammachantuek ... Seyedali Mirjalili
19 Aug 2022
PloS one | VOL. 17

Grid-Based Mobile Robot Path Planning Using Aging-Based Ant Colony Optimization Algorithm in Static and Dynamic Environments.
Fatin Hassan Ajeil ... Ahmad Taher Azar
Sensors (Basel, Switzerland) | VOL. 20
Fatin Hassan Ajeil, et. al.Fatin Hassan Ajeil ... Ahmad Taher Azar
28 Mar 2020
Sensors (Basel, Switzerland) | VOL. 20

Reinforcement-Learning-Aided Safe Planning for Aerial Robots to Collect Data in Dynamic Environments
Behzad Khamidehi ... Elvino S Sousa
IEEE Internet of Things Journal | VOL. 9
Behzad Khamidehi, et. al.Behzad Khamidehi ... Elvino S Sousa
01 Aug 2022
IEEE Internet of Things Journal | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Trajectory Distributions for Assisted Teleoperation and Path Planning.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Robotics and AI