Learning From Imperfect Demonstrations From Agents With Varying Dynamics

Zhangjie Cao,Dorsa Sadigh

doi:10.1109/lra.2021.3068912

Zhangjie Cao, Dorsa Sadigh

Open Access

https://doi.org/10.1109/lra.2021.3068912

Copy DOI

Journal: IEEE Robotics and Automation Letters	Publication Date: Mar 31, 2021
Citations: 31	License type: publisher-specific-oa

Affiliation: Stanford University

Abstract

Imitation learning enables robots to learn from demonstrations. Previous imitation learning algorithms usually assume access to optimal expert demonstrations. However, in many real-world applications, this assumption is limiting. Most collected demonstrations are not optimal or are produced by an agent with slightly different dynamics. We therefore address the problem of imitation learning when the demonstrations can be sub-optimal or be drawn from agents with varying dynamics. We develop a metric composed of a feasibility score and an optimality score to measure how useful a demonstration is for imitation learning. The proposed score enables learning from more informative demonstrations, and disregarding the less relevant demonstrations. Our experiments on four environments in simulation and on a real robot show improved learned policies with higher expected return.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning From Imperfect Demonstrations From Agents With Varying Dynamics

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Similar Papers

Survey of imitation learning: tradition and new advances
Chao Zhang ... Weijie Liu
Journal of Image and Graphics | VOL. 28
Chao Zhang, et. al.Chao Zhang ... Weijie Liu
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Comparison of Control Methods Based on Imitation Learning for Autonomous Driving
Yinfeng Gao ... Zhonghua Pang
-
Yinfeng Gao, et. al.Yinfeng Gao ... Zhonghua Pang
01 Dec 2019
01 Dec 2019

COLLECT AND PREPARE DATASET FROM HUMAN-DEMONSTRATIONS OF SIMPLE MANIPULATION TASKS USING XARM7 ROBOTIC ARM FOR TRAINING BEHAVIOUR CLONING ALGORITHM
Юрій Кривенчук ... Максим Шаварський
Herald of Khmelnytskyi National University. Technical sciences | VOL. 333
Юрій Кривенчук, et. al.Юрій Кривенчук ... Максим Шаварський
25 Apr 2024
Herald of Khmelnytskyi National University. Technical sciences | VOL. 333

Sample-Efficient I-Projections for Robot Learning

-

19 Apr 2021
19 Apr 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning From Imperfect Demonstrations From Agents With Varying Dynamics

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters