Instance-based defense against adversarial attacks in Deep Reinforcement Learning

Javier García,Ismael Sagredo

doi:10.1016/j.engappai.2021.104514

Abstract

Deep Reinforcement Learning systems are now a hot topic in Machine Learning for their effectiveness in many complex tasks, but their application in safety-critical domains (e.g., robot control or self-autonomous driving) remains dangerous without mechanism to detect and prevent risk situations. In Deep RL, such risk is mostly in the form of adversarial attacks, which introduce small perturbations to sensor inputs with the aim of changing the network-based decisions and thus cause catastrophic situations. In the light of these dangers, a promising line of research is that of providing these Deep RL algorithms with suitable defenses, especially when deploying in real environments. This paper suggests that this line of research could be greatly improved by the concepts from the existing research field of Safe Reinforcement Learning, which has been postulated as a family of RL algorithms capable of providing defenses against many forms of risks. However, the connections between Safe RL and the design of defenses against adversarial attacks in Deep RL remain largely unexplored. This paper seeks to explore precisely some of these connections. In particular, this paper proposes to reuse some of the concepts from existing Safe RL algorithms to create a novel and effective instance-based defense for the deployment stage of Deep RL policies. The proposed algorithm uses a risk function based on how far a state is from the state space known by the agent, that allows identifying and preventing adversarial situations. The success of the proposed defense has been evaluated in 4 Atari games.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Instance-based defense against adversarial attacks in Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence

Lead the way for us

Journal: Engineering Applications of Artificial Intelligence	Publication Date: Oct 28, 2021
Citations: 2

Similar Papers

Multi-granularity coverage criteria for deep reinforcement learning systems
Ying Shi ... Zheng Zheng
The Journal of Systems & Software | VOL. 212
Ying Shi, et. al.Ying Shi ... Zheng Zheng
11 Mar 2024
The Journal of Systems & Software | VOL. 212

Adversarial Black-Box Attacks on Vision-based Deep Reinforcement Learning Agents
Atanas Tanev ... Ruediger Dillmann
-
Atanas Tanev, et. al.Atanas Tanev ... Ruediger Dillmann
04 Mar 2021
04 Mar 2021

What can classic Atari video games tell us about the human brain?
Raphael Köster ... Martin J Chadwick
Neuron | VOL. 109
Raphael Köster, et. al.Raphael Köster ... Martin J Chadwick
01 Feb 2021
Neuron | VOL. 109

Artificial Intelligence and the Common Sense of Animals.
Murray Shanahan ... Lucy Cheke
Trends in Cognitive Sciences | VOL. 24
Murray Shanahan, et. al.Murray Shanahan ... Lucy Cheke
08 Oct 2020
Trends in Cognitive Sciences | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Instance-based defense against adversarial attacks in Deep Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Engineering Applications of Artificial Intelligence