Learning in hide-and-seek

Qingsi Wang,Mingyan Liu

doi:10.1109/infocom.2014.6847942

Abstract

Existing work on pursuit-evasion problems typically either assumes stationary or heuristic behavior of one side and examines countermeasures of the other, or assumes both sides to be strategic which leads to a game theoretical framework. Results from the former may lack robustness against changes in the adversarial behavior, while those from the latter are often difficult to justify due to the implied full information (either as realizations or as distributions) and rationality, both of which may be limited in practice. In this paper, we take a different approach by assuming an intelligent pursuer/evader that is adaptive to the information available to it and is capable of learning over time with performance guarantee. Within this context we investigate two cases. In the first case we assume either the evader or the pursuer is aware of the type of learning algorithm used by the opponent, while in the second case neither side has such information and thus must try to learn. We show that the optimal policies in the first case have a greedy nature, hiding/seeking in the location that the opponent is the least/most likely to appear. This result is then used to assess the performance of the learning algorithms that both sides employ in the second case, which is shown to be mutually optimal and there is no loss for either side compared to the case when it completely knows the adaptive pattern used by the adversary and responses optimally.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning in hide-and-seek

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning in Hide-and-Seek
Qingsi Wang ... Mingyan Liu
IEEE/ACM Transactions on Networking | VOL. 24
Qingsi Wang, et. al.Qingsi Wang ... Mingyan Liu
01 Apr 2016
IEEE/ACM Transactions on Networking | VOL. 24

Static Routing in Stochastic Scheduling: Performance Guarantees and Asymptotic Optimality
Santiago R Balseiro ... David B Brown
Operations Research | VOL. 66
Santiago R Balseiro, et. al.Santiago R Balseiro ... David B Brown
01 Nov 2018
Operations Research | VOL. 66

Learning to Order for Inventory Systems with Lost Sales and Uncertain Supplies
Boxiao Chen ... Zhengyuan Zhou
Management Science | VOL. -
Boxiao Chen, et. al.Boxiao Chen ... Zhengyuan Zhou
04 Mar 2024
Management Science | VOL. -

Optimal admission control for high speed networks: a dynamic programming approach
T Jiminez
-
T JiminezT Jiminez
12 Dec 2000
12 Dec 2000

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning in hide-and-seek

Abstract

Talk to us

Similar Papers