Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Yijiong Lin,Yisheng Guan,Paul Weng,Jiancong Huang,Matthieu Zimmer,Juan Rojas

doi:10.1109/lra.2020.3013937

Abstract

Deep Reinforcement Learning (RL) is a promising approach for adaptive robot control, but its current application to robotics is currently hindered by high sample requirements. To alleviate this issue, we propose to exploit the symmetries present in robotic tasks. Intuitively, symmetries from observed trajectories define transformations that leave the space of feasible RL trajectories invariant and can be used to generate new feasible trajectories, which could be used for training. Based on this data augmentation idea, we formulate a general framework, called Invariant Transform Experience Replay that we present with two techniques: (i) Kaleidoscope Experience Replay exploits reflectional symmetries and (ii) Goal-augmented Experience Replay which takes advantage of lax goal definitions. In the Fetch tasks from OpenAI Gym, our experimental results show significant increases in learning rates and success rates. Particularly, we attain a 13, 3, and 5 times speedup in the pushing, sliding, and pick-and-place tasks respectively in the multi-goal setting. Performance gains are also observed in similar tasks with obstacles and we successfully deployed a trained policy on a real Baxter robot. Our work demonstrates that invariant transformations on RL trajectories are a promising methodology to speed up learning in deep RL.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Oct 1, 2020
Citations: 16

Similar Papers

Model & Feature Agnostic Eye-in-Hand Visual Servoing using Deep Reinforcement Learning with Prioritized Experience Replay
Prerna Singh ... Virender Singh
-
Prerna Singh, et. al.Prerna Singh ... Virender Singh
01 Oct 2019
01 Oct 2019

Inference-Based Posteriori Parameter Distribution Optimization.
Xuesong Wang ... Tianyi Li
IEEE transactions on cybernetics | VOL. 52
Xuesong Wang, et. al.Xuesong Wang ... Tianyi Li
07 Oct 2020
IEEE transactions on cybernetics | VOL. 52

Explaining Deep Q-Learning Experience Replay with SHapley Additive exPlanations
Robert S Sullivan ... Luca Longo
Machine Learning and Knowledge Extraction | VOL. 5
Robert S Sullivan, et. al.Robert S Sullivan ... Luca Longo
09 Oct 2023
Machine Learning and Knowledge Extraction | VOL. 5

SLER: Self-generated long-term experience replay for continual reinforcement learning
Chunmao Li ... Xupeng Geng
Applied Intelligence | VOL. 51
Chunmao Li, et. al.Chunmao Li ... Xupeng Geng
07 Aug 2020
Applied Intelligence | VOL. 51

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters