Self-Supervised Correspondence in Visuomotor Policy Learning

Peter Florence,Lucas Manuelli,Russ Tedrake

doi:10.1109/lra.2019.2956365

Peter Florence, Lucas Manuelli + Show 1 more

Open Access

https://doi.org/10.1109/lra.2019.2956365

Copy DOI

Journal: IEEE Robotics and Automation Letters	Publication Date: Dec 5, 2019
Citations: 97	License type: publisher-specific, author manuscript

Affiliation: Massachusetts Institute of Technology

Abstract

In this letter, we explore using self-supervised correspondence for improving the generalization performance and sample efficiency of visuomotor policy learning. Prior work has primarily used approaches such as autoencoding, pose-based losses, and end-to-end policy optimization in order to train the visual portion of visuomotor policies. We instead propose an approach using self-supervised dense visual correspondence training and show that this enables visuomotor policy learning with surprisingly high generalization performance with modest amounts of data. Using imitation learning, we demonstrate extensive hardware validation on challenging manipulation tasks with as few as 50 demonstrations. Our learned policies can generalize across classes of objects, react to deformable object configurations, and manipulate textureless symmetrical objects in a variety of backgrounds, all with closed-loop, real-time vision-based policies. Simulated imitation learning experiments suggest that correspondence training offers sample complexity and generalization benefits compared to autoencoding and end-to-end training.

Full Text