Differentiable Learning of Scalable Multi-Agent Navigation Policies

Xiaohan Ye,Xifeng Gao,Zherong Pan,Bo Ren,Kui Wu

doi:10.1109/lra.2023.3248440

Abstract

We present an end-to-end differentiable learning algorithm for multi-agent navigation policies. Compared with prior model-free learning algorithms, our method leads to a significant speedup via the gradient information. Our key innovation lies in a novel differentiability analysis of the optimization-based crowd simulation algorithm via the implicit function theorem. Inspired by continuum multi-agent modeling techniques, we further propose a kernel-based policy parameterization, allowing our learned policy to scale up to an arbitrary number of agents without re-training. We evaluate our algorithm on two tasks in obstacle-rich environments, partially labeled navigation and evacuation, for which loss functions can be defined making the entire task learnable in an end-to-end manner. The results show that our method can achieve more than one order of magnitude speedup over model-free baselines and readily scale to unseen target configurations and agent sizes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Differentiable Learning of Scalable Multi-Agent Navigation Policies

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Apr 1, 2023
Citations: 4

Similar Papers

A study on foraging behavior of swarm robots using reinforcement learning techniques

-

03 Feb 2017
03 Feb 2017

Learning rules of simplified boardgames by observing

-

27 Aug 2012
27 Aug 2012

Query-specific learning and inference for probabilistic graphical models
...
-
, et. al. ...
01 Jan 2010
01 Jan 2010

Reinforcement Learning for Visual Object Detection
Stefan Mathe ... Cristian Sminchisescu
-
Stefan Mathe, et. al.Stefan Mathe ... Cristian Sminchisescu
01 Jun 2016
01 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Differentiable Learning of Scalable Multi-Agent Navigation Policies

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters