End-to-end nonprehensile rearrangement with deep reinforcement learning and simulation-to-reality transfer

Weihao Yuan,Kaiyu Hang,Danica Kragic,Michael Y Wang,Johannes A Stork

doi:10.1016/j.robot.2019.06.007

Abstract

Nonprehensile rearrangement is the problem of controlling a robot to interact with objects through pushing actions in order to reconfigure the objects into a predefined goal pose. In this work, we rearrange one object at a time in an environment with obstacles using an end-to-end policy that maps raw pixels as visual input to control actions without any form of engineered feature extraction. To reduce the amount of training data that needs to be collected using a real robot, we propose a simulation-to-reality transfer approach. In the first step, we model the nonprehensile rearrangement task in simulation and use deep reinforcement learning to learn a suitable rearrangement policy, which requires in the order of hundreds of thousands of example actions for training. Thereafter, we collect a small dataset of only 70 episodes of real-world actions as supervised examples for adapting the learned rearrangement policy to real-world input data. In this process, we make use of newly proposed strategies for improving the reinforcement learning process, such as heuristic exploration and the curation of a balanced set of experiences. We evaluate our method in both simulation and real setting using a Baxter robot to show that the proposed approach can effectively improve the training process in simulation, as well as efficiently adapt the learned policy to the real world application, even when the camera pose is different from simulation. Additionally, we show that the learned system not only can provide adaptive behavior to handle unforeseen events during executions, such as distraction objects, sudden changes in positions of the objects, and obstacles, but also can deal with obstacle shapes that were not present in the training process.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

End-to-end nonprehensile rearrangement with deep reinforcement learning and simulation-to-reality transfer

Abstract

Talk to us

Similar Papers

More From: Robotics and Autonomous Systems

Lead the way for us

Journal: Robotics and Autonomous Systems	Publication Date: Jul 4, 2019
Citations: 44

Similar Papers

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Deep Reinforcement Learning: A New Frontier in Computer Vision Research
Sejuti Rahman ... Sujan Sarker
-
Sejuti Rahman, et. al.Sejuti Rahman ... Sujan Sarker
01 Jan 2020
01 Jan 2020

Deep reinforcement learning and its applications in medical imaging and radiation therapy: a survey
Lanyu Xu ... Ning Wen
Physics in Medicine & Biology | VOL. 67
Lanyu Xu, et. al.Lanyu Xu ... Ning Wen
11 Nov 2022
Physics in Medicine & Biology | VOL. 67

Multi-granularity coverage criteria for deep reinforcement learning systems
Ying Shi ... Zheng Zheng
The Journal of Systems & Software | VOL. 212
Ying Shi, et. al.Ying Shi ... Zheng Zheng
11 Mar 2024
The Journal of Systems & Software | VOL. 212

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-end nonprehensile rearrangement with deep reinforcement learning and simulation-to-reality transfer

Abstract

Talk to us

Similar Papers

More From: Robotics and Autonomous Systems