에피소드 매개변수 최적화를 이용한 확률게임에서의 추적정책 성능 향상

Dong-Jun Kwak,H.-Jin Kim

doi:10.5139/jksas.2012.40.3.215

Abstract

In this paper, we introduce an optimization method to improve pursuit performance of a pursuer in a pursuit-evasion game (PEG). Pursuers build a probability map and employ a hybrid pursuit policy which combines the merits of local-max and global-max pursuit policies to search and capture evaders as soon as possible in a 2-dimensional space. We propose an episodic parameter optimization (EPO) algorithm to learn good values for the weighting parameters of a hybrid pursuit policy. The EPO algorithm is performed while many episodes of the PEG are run repeatedly and the reward of each episode is accumulated using reinforcement learning, and the candidate weighting parameter is selected in a way that maximizes the total averaged reward by using the golden section search method. We found the best pursuit policy in various situations which are the different number of evaders and the different size of spaces and analyzed results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

에피소드 매개변수 최적화를 이용한 확률게임에서의 추적정책 성능 향상

Abstract

Talk to us

Similar Papers

More From: Journal of the Korean Society for Aeronautical & Space Sciences

Lead the way for us

Similar Papers

Policy Improvements for Probabilistic Pursuit-Evasion Game
Dong Jun Kwak ... H Jin Kim
Journal of Intelligent & Robotic Systems | VOL. 74
Dong Jun Kwak, et. al.Dong Jun Kwak ... H Jin Kim
11 Jul 2013
Journal of Intelligent & Robotic Systems | VOL. 74

Probabilistic pursuit-evasion games: theory, implementation, and experimental evaluation
R Vidal ... S Sastry
IEEE Transactions on Robotics and Automation | VOL. 18
R Vidal, et. al.R Vidal ... S Sastry
01 Oct 2002
IEEE Transactions on Robotics and Automation | VOL. 18

Police vehicular pursuits: a descriptive analysis of state agencies' written policy
Wendy L Hicks
Policing: An International Journal of Police Strategies & Management | VOL. 29
Wendy L HicksWendy L Hicks
01 Jan 2006
Policing: An International Journal of Police Strategies & Management | VOL. 29

Multi-agent Deep Reinforcement Learning for Pursuit-Evasion Game Scalability
Lin Xu ... Jiangwen Xiao
-
Lin Xu, et. al.Lin Xu ... Jiangwen Xiao
08 Sep 2019
08 Sep 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

에피소드 매개변수 최적화를 이용한 확률게임에서의 추적정책 성능 향상

Abstract

Talk to us

Similar Papers

More From: Journal of the Korean Society for Aeronautical &amp; Space Sciences

More From: Journal of the Korean Society for Aeronautical & Space Sciences