Reinforcement Learning for Zone Based Multiagent Pathfinding under Uncertainty

Jiajing Ling,Tarun Gupta,Akshat Kumar

doi:10.1609/icaps.v30i1.6751

Abstract

We address the problem of multiple agents finding their paths from respective sources to destination nodes in a graph (also called MAPF). Most existing approaches assume that all agents move at fixed speed, and that a single node accommodates only a single agent. Motivated by the emerging applications of autonomous vehicles such as drone traffic management, we present zone-based path finding (or ZBPF) where agents move among zones, and agents' movements require uncertain travel time. Furthermore, each zone can accommodate multiple agents (as per its capacity). We also develop a simulator for ZBPF which provides a clean interface from the simulation environment to learning algorithms. We develop a novel formulation of the ZBPF problem using difference-of-convex functions (DC) programming. The resulting approach can be used for policy learning using samples from the simulator. We also present a multiagent credit assignment scheme that helps our learning approach converge faster. Empirical results in a number of 2D and 3D instances show that our approach can effectively minimize congestion in zones, while ensuring agents reach their final destinations.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: Jun 1, 2020
Citations: 2	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Zone Based Multiagent Pathfinding under Uncertainty

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Similar Papers

Multiple agents in biological control: improving the odds?
Madlen Denoth ... Judith H Myers
Biological Control | VOL. 24
Madlen Denoth, et. al.Madlen Denoth ... Judith H Myers
01 May 2002
Biological Control | VOL. 24

How do we judge what causes cancer? the meat controversy.
Paolo Vineis ... Bernard W Stewart
International Journal of Cancer | VOL. 138
Paolo Vineis, et. al.Paolo Vineis ... Bernard W Stewart
25 Feb 2016
International Journal of Cancer | VOL. 138

Group variable selection via [formula omitted] regularization and application to optimal scoring
Duy Nhat Phan ... Hoai An Le Thi
Neural Networks | VOL. 118
Duy Nhat Phan, et. al.Duy Nhat Phan ... Hoai An Le Thi
04 Jul 2019
Neural Networks | VOL. 118

Quasi‐oppositional wild horse optimization based multi‐agent path finding scheme for real time IoT systems
Radwa Marzouk ... Manar Ahmed Hamza
Expert Systems | VOL. 39
Radwa Marzouk, et. al.Radwa Marzouk ... Manar Ahmed Hamza
01 Aug 2022
Expert Systems | VOL. 39

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reinforcement Learning for Zone Based Multiagent Pathfinding under Uncertainty

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling