Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach

Xiangkun He,Chen Lv

doi:10.1109/tai.2023.3237665

Abstract

With deep neural networks based function approximators, reinforcement learning holds the promise of learning complex end-to-end robotic controllers that can map high-dimensional sensory information directly to control policies. However, a common challenge, especially for robotics, is sample-efficient learning from sparse rewards, in which an agent is required to find a long sequence of “correct” actions to achieve a desired outcome. Unfortunately, inevitable perturbations on observations may make this task trickier to solve. Here, this paper advances a novel robust goal-conditioned reinforcement learning approach for end-to-end robotic control in adversarial and sparse reward environments. Specifically, a mixed adversarial attack scheme is presented to generate diverse adversarial perturbations on observations by combining white-box and black-box attacks. Meanwhile, a hindsight experience replay technique considering observation perturbations is developed to turn a failed experience into a successful one and generate the policy trajectories perturbed by the mixed adversarial attacks. Additionally, a robust goal-conditioned actor-critic method is proposed to learn goal-conditioned policies and keep the variations of the perturbed policy trajectories within bounds. Finally, the proposed method is evaluated on three tasks with adversarial attacks and sparse reward settings. The results indicate that our scheme can ensure robotic control performance and policy robustness on the adversarial and sparse reward tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Artificial Intelligence	Publication Date: Jan 1, 2024
Citations: 7	License type: cc-by

R Discovery Prime

R Discovery Prime

Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Artificial Intelligence

Lead the way for us

Similar Papers

Accelerated Robot Learning via Human Brain Signals
Iretiayo Akinola ... Paul Sajda
-
Iretiayo Akinola, et. al.Iretiayo Akinola ... Paul Sajda
01 May 2020
01 May 2020

Robot controller architecture for user friendly application deployment
Christian Richter ... Tim C Lueth
-
Christian Richter, et. al.Christian Richter ... Tim C Lueth
01 Dec 2010
01 Dec 2010

Data-efficient Deep Reinforcement Learning Method Toward Scaling Continuous Robotic Task with Sparse Rewards
Junkai Ren ... Yixing Lan
-
Junkai Ren, et. al.Junkai Ren ... Yixing Lan
15 Jul 2021
15 Jul 2021

SAEIR: Sequentially Accumulated Entropy Intrinsic Reward for Cooperative Multi-Agent Reinforcement Learning with Sparse Reward
Xin He ... Hongwei Ge
-
Xin He, et. al.Xin He ... Hongwei Ge
01 Aug 2024
01 Aug 2024

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Artificial Intelligence