Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games

Kristal K Trejo,Julio B Clempner,Alexander S Poznyak

doi:10.1016/j.jcss.2017.12.004

Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games

Kristal K Trejo, Julio B Clempner + Show 1 more

Open Access

https://doi.org/10.1016/j.jcss.2017.12.004

Copy DOI

Journal: Journal of Computer and System Sciences	Publication Date: Jan 9, 2018
Citations: 15

Affiliation: Instituto Politécnico Nacional

#Stackelberg Security Games #Patrolling Strategies + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper presents a novel approach for adapting attackers and defenders preferred patrolling strategies using reinforcement learning (RL) based-on average rewards in Stackelberg security games. We propose a framework that combines three different paradigms: prior knowledge, imitation and temporal-difference method. The overall RL architecture involves two highest components: the Adaptive Primary Learning architecture and the Actor–critic architecture. In this work we consider that defenders and attackers conforms coalitions in the Stackelberg security game, these are reached by computing the Strong Lp-Stackelberg/Nash equilibrium. We present a numerical example that validates the proposed RL approach measuring the benefits for security resource allocation.

Full Text