Perimeter Patrol Research Articles

Inverse reinforcement learning (IRL), analogously to RL, refers to both the problem and associated methods by which an agent passively observing another agent's actions over time, seeks to learn the latter's reward function. The learning agent is typically called the learner while the observed agent is often an expert in popular applications such as in learning from demonstrations. Some of the assumptions that underlie current IRL methods are impractical for many robotic applications. Specifically, they assume that the learner has full observability of the expert as it performs its task; that the learner has full knowledge of the expert's dynamics; and that there is always only one expert agent in the environment. For example, these assumptions are particularly restrictive in our application scenario where a subject robot is tasked with penetrating a perimeter patrol by two other robots after observing them from a vantage point. In our instance of this problem, the learner can observe at most 10% of the patrol.We relax these assumptions and systematically generalize a known IRL method, Maximum Entropy IRL, to enable the subject to learn the preferences of the patrolling robots, subsequently their behaviors, and predict their future positions well enough to plan a route to its goal state without being spotted. Challenged by occlusion, multiple interacting robots, and partially known dynamics we demonstrate empirically that the generalization improves significantly on several baselines in its ability to inversely learn in this application setting. Of note, it leads to significant improvement in the learner's overall success rate of penetrating the patrols. Our methods represent significant steps towards making IRL pragmatic and applicable to real-world contexts.

Substantial automation will be needed to allow operators to control the large teams of robots envisioned for search and rescue, perimeter patrol, and a wide variety of military tasks. Both analysis and research point to navigation and path planning as prime candidates for automation. When operators are isolated from robot navigation, however, there may be loss of situation awareness (SA) and difficulties in monitoring robots for failures or abnormal behavior. Operator's navigational strategies are quite complex and extremely changeable at foraging tasks in unknown environment reflecting background knowledge and expectations about human and natural environments. These considerations are missing from automated path planning algorithms leading to differences in search patterns and exploration biases between human and automatically generated paths. Effectively integrating automated path planning into multirobot systems would require demonstrating that: 1-automated path planning performs as well as humans on measures such as area coverage and 2- use of automated path planning does not degrade performance of related human tasks such as finding and marking victims. In this paper we seek to compare the divergence between human manual control and autonomous path planning at an urban search and rescue (USAR) task using fractal analysis to characterize the paths generated by the two methods. Area coverage and human contributions to mixed-initiative planning are compared with fully automated path planning. Finally, the impact of automated planning on related victim identification and marking tasks is compared for automated paths and paths generated by previous participants.

Perimeter Patrol Research Articles

Articles published on Perimeter Patrol

Coordination of Robot Teams Over Long Distances: From Georgia Tech to Tokyo Tech and Back-An 11,000-km Multirobot Experiment

Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions

Lower Bounding Linear Program for the Perimeter Patrol Optimization Problem

Optimization of Perimeter Patrol Operations Using Unmanned Aerial Vehicles

Adaptive Immune System TH1/TH2 Differentiation Mechanism Inspired Perimeter Patrol Control Strategy

Approximate dynamic programming with state aggregation applied to UAV perimeter patrol

Human vs. Algorithmic Path Planning for Search and Rescue by Robot Teams

Security PIDS with physical sensors, real-time pattern recognition, and continuous patrol

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Perimeter Patrol Research Articles

Articles published on Perimeter Patrol

Coordination of Robot Teams Over Long Distances: From Georgia Tech to Tokyo Tech and Back-An 11,000-km Multirobot Experiment

Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions

Lower Bounding Linear Program for the Perimeter Patrol Optimization Problem

Optimization of Perimeter Patrol Operations Using Unmanned Aerial Vehicles

Adaptive Immune System TH1/TH2 Differentiation Mechanism Inspired Perimeter Patrol Control Strategy

Approximate dynamic programming with state aggregation applied to UAV perimeter patrol

Human vs. Algorithmic Path Planning for Search and Rescue by Robot Teams

Security PIDS with physical sensors, real-time pattern recognition, and continuous patrol