Intrinsically Motivated Multi-Goal Reinforcement Learning Using Robotics Environment Integrated with OpenAI Gym

Sivasubramanian Balasubramanian

doi:10.55662/jst.2023.4502

Abstract

Sparse reward is one of the most challenging problems in reinforcement learning (RL). Hindsight Experience Replay (HER) attempts to address this issue by converting a failed experience to a successful one by relabelling the goals. In open-ended and changing environments, agents face a wide range of potential tasks that might not come with associated reward functions. Such autonomous learning agents must set their own tasks and build their own curriculum through an intrinsically motivated exploration. Because some tasks might prove easy and some impossible, agents must actively select which task to practice at any given moment, to maximize their overall mastery on the set of learnable tasks. The purpose of this technical report is two-fold. First, it introduces a suite of challenging continuous control tasks (integrated with OpenAI Gym) based on currently existing robotics hardware. The tasks include pushing, sliding and pick & place with a Fetch robotic arm as well as in-hand object manipulation with a Shadow Dexterous Hand. All tasks have sparse binary rewards and follow a Multi-Goal Reinforcement Learning (RL) framework in which an agent is told what to do using an additional input. The second part of the paper presents a set of concrete research ideas for improving RL algorithms, most of which are related to Multi-Goal RL and Hindsight Experience Replay. The Fetch environments are based on the 7-DoF Fetch robotics arm,2 which has a two-ﬁngered parallel gripper. Agents focus on achievable tasks first and focus back on tasks that are being forgotten. Experiments conducted in a new multi-task multi-goal robotic environment show that our algorithm benefits from these two ideas and demonstrate properties of robustness to distracting tasks, forgetting and changes in body properties

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intrinsically Motivated Multi-Goal Reinforcement Learning Using Robotics Environment Integrated with OpenAI Gym

Abstract

Talk to us

Similar Papers

More From: Journal of Science & Technology

Lead the way for us

Journal: Journal of Science & Technology	Publication Date: Nov 17, 2023
License type: CC BY-NC-SA 4.0

Similar Papers

Hindsight Balanced Reward Shaping
Mengxuan Shao ... Kun Han
-
Mengxuan Shao, et. al.Mengxuan Shao ... Kun Han
01 Jan 2023
01 Jan 2023

Overcoming Exploration in Reinforcement Learning with Demonstrations
Ashvin Nair ... Marcin Andrychowicz
-
Ashvin Nair, et. al.Ashvin Nair ... Marcin Andrychowicz
01 May 2018
01 May 2018

Curiosity-Driven Exploration for Off-Policy Reinforcement Learning Methods
Boyao Li ... Jiayi Li
-
Boyao Li, et. al.Boyao Li ... Jiayi Li
01 Dec 2019
01 Dec 2019

Impulsive maneuver strategy for multi-agent orbital pursuit-evasion game under sparse rewards
Hongbo Wang ... Yao Zhang
Aerospace Science and Technology | VOL. 155
Hongbo Wang, et. al.Hongbo Wang ... Yao Zhang
29 Sep 2024
Aerospace Science and Technology | VOL. 155

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intrinsically Motivated Multi-Goal Reinforcement Learning Using Robotics Environment Integrated with OpenAI Gym

Abstract

Talk to us

Similar Papers

More From: Journal of Science &amp; Technology

More From: Journal of Science & Technology