Low Resource-Reallocation Defense Strategies for Repeated Security Games With No Prior Knowledge and Limited Observability

Jin Zhu,Qiang Ling,Geir E Dullerud,Jinglong Zhang

doi:10.1109/tcds.2023.3241364

Jin Zhu, Qiang Ling + Show 2 more

https://doi.org/10.1109/tcds.2023.3241364

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

This paper takes into account general repeated security games with no prior knowledge, i.e., the game payoffs and the attacker’s behavior model are unknown, and limited observability. Besides the traditional “regret” criterion“, reallocation times” is introduced as an additional criterion that provides a more comprehensive evaluation of the defense strategies. For such games, a novel Random-Walk Perturbations with Uniform Exploration (RWP-UE) algorithm is proposed and we deduce the corresponding upper bound of the expected regret and expected reallocation times. Theoretical analysis shows that the RWP-UE algorithm achieves not only low regret with the same magnitude as existing achievements but also fewer reallocation times. Experiments are carried out against four types of attackers, and the results illustrate that the RWP-UE algorithm achieves superior performance.

Full Text