RLUC: Strengthening robustness by attaching constraint considerations to policy network

Jianmin Tang,Quan Liu,Fanzhang Li,Fei Zhu

doi:10.1016/j.eswa.2023.121475

Abstract

Deep reinforcement learning is widely used in many fields. However, recent research has found vulnerabilities in agents trained by reinforcement learning algorithms and raised concerns about the deployment of agents in the real world. Due to the addition of imperceptible adversarial examples to the agent’s observed state, the policy network is tricked into acting in a suboptimal way. To solve this issue, we introduce an approach, named reinforcement learning under local constraints (RLUC), aimed at bolstering the robustness of agents when countering potent adversarial attacks. Considering the sensitivity of the policy network to the observation state, when adversarial samples are those perturbed by the adversaries injected into the observation states, the policy network generates large fluctuations in the last connection layer. In order to minimize the divergence in the distribution of policy outputs, our method attaches constraints at each layer of the policy network, allowing the agent to stick to the origin action under adversarial attacks. RLUC endeavor to minimize the total variance of the output of actor constructed by neural network layers between polluted state and clean state. RLUC is evaluated and analyzed in different aspects compared to previous excellent works. RLUC underwent evaluation on Mujoco benchmarks and was subjected to testing against six formidable adversarial attacks. Experiment results show that the proposed method outperforms existing state-of-the-art methods and significantly improves the robustness and smoothness of the agent’s policy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RLUC: Strengthening robustness by attaching constraint considerations to policy network

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Similar Papers

Offense and defence against adversarial sample: A reinforcement learning method in energy trading market
Donghe Li ... Xiao Liao
Frontiers in Energy Research | VOL. 10
Donghe Li, et. al.Donghe Li ... Xiao Liao
12 Jan 2023
Frontiers in Energy Research | VOL. 10

XSS adversarial example attacks based on deep reinforcement learning
Li Chen ... Tao Li
Computers & Security | VOL. 120
Li Chen, et. al.Li Chen ... Tao Li
10 Jul 2022
Computers & Security | VOL. 120

Adversarial Robustness of Deep Reinforcement Learning Based Dynamic Recommender Systems.
Siyu Wang ... Xiaocong Chen
Frontiers in big data | VOL. 5
Siyu Wang, et. al.Siyu Wang ... Xiaocong Chen
03 May 2022
Frontiers in big data | VOL. 5

Towards Robust Ensemble Defense Against Adversarial Examples Attack
Nag Mani ... Melody Moh
-
Nag Mani, et. al.Nag Mani ... Melody Moh
01 Dec 2019
01 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RLUC: Strengthening robustness by attaching constraint considerations to policy network

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications