A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Homanga Bharadhwaj,Yoshua Bengio,Zihan Wang,Liam Paull

doi:10.1109/icra.2019.8794310

Abstract

Learning effective visuomotor policies for robots purely from data is challenging, but also appealing since a learning-based system should not require manual tuning or calibration. In the case of a robot operating in a real environment the training process can be costly, time-consuming, and even dangerous since failures are common at the start of training. For this reason, it is desirable to be able to leverage simulation and off-policy data to the extent possible to train the robot. In this work, we introduce a robust framework that plans in simulation and transfers well to the real environment. Our model incorporates a gradient-descent based planning module, which, given the initial image and goal image, encodes the images to a lower dimensional latent state and plans a trajectory to reach the goal. The model, consisting of the encoder and planner modules, is first trained through a meta-learning strategy in simulation. We subsequently perform adversarial domain transfer on the encoder by using a bank of unlabelled but random images from the simulation and real environments to enable the encoder to map images from the real and simulated environments to a similarly distributed latent representation. By fine tuning the entire model (encoder + planner) with only a few real world expert demonstrations, we show successful planning performances in different navigation tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Training of CT-guided Periradicular Therapy in a Realistic Simulation Environment – Evaluation and Recommendations for a Training Curriculum
Victor Paul Bela Braun ... Paul Jahnke
Academic Radiology | VOL. 28
Victor Paul Bela Braun, et. al.Victor Paul Bela Braun ... Paul Jahnke
14 Aug 2020
Academic Radiology | VOL. 28

A Study of Perceptual Performance in Haptic Virtual Environments
Marcia K O’Malley ... Gina Upperman
Journal of Robotics and Mechatronics | VOL. 18
Marcia K O’Malley, et. al.Marcia K O’Malley ... Gina Upperman
20 Aug 2006
Journal of Robotics and Mechatronics | VOL. 18

Robust Learning from Observation with Model Misspecification

-

20 Apr 2022
20 Apr 2022

Robust Learning from Observation with Model Misspecification

-

20 Apr 2022
20 Apr 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Data-Efficient Framework for Training and Sim-to-Real Transfer of Navigation Policies

Abstract

Talk to us

Similar Papers