Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning.

Guangda Chen,Jianmin Ji,Yu’An Chen,Jun Ma,Pei Xu,Lifan Pan,Shunyi Yao,Xiaoping Chen

doi:10.3390/s20174836

Abstract

It is challenging to avoid obstacles safely and efficiently for multiple robots of different shapes in distributed and communication-free scenarios, where robots do not communicate with each other and only sense other robots’ positions and obstacles around them. Most existing multi-robot collision avoidance systems either require communication between robots or require expensive movement data of other robots, like velocities, accelerations and paths. In this paper, we propose a map-based deep reinforcement learning approach for multi-robot collision avoidance in a distributed and communication-free environment. We use the egocentric local grid map of a robot to represent the environmental information around it including its shape and observable appearances of other robots and obstacles, which can be easily generated by using multiple sensors or sensor fusion. Then we apply the distributed proximal policy optimization (DPPO) algorithm to train a convolutional neural network that directly maps three frames of egocentric local grid maps and the robot’s relative local goal positions into low-level robot control commands. Compared to other methods, the map-based approach is more robust to noisy sensor data, does not require robots’ movement data and considers sizes and shapes of related robots, which make it to be more efficient and easier to be deployed to real robots. We first train the neural network in a specified simulator of multiple mobile robots using DPPO, where a multi-stage curriculum learning strategy for multiple scenarios is used to improve the performance. Then we deploy the trained model to real robots to perform collision avoidance in their navigation without tedious parameter tuning. We evaluate the approach with multiple scenarios both in the simulator and on four differential-drive mobile robots in the real world. Both qualitative and quantitative experiments show that our approach is efficient and outperforms existing DRL-based approaches in many indicators. We also conduct ablation studies showing the positive effects of using egocentric grid maps and multi-stage curriculum learning.

Highlights

With the rapid development of autonomous mobile robots in recent years, more and more attentions have been paid to multi-robot collision avoidance, which is crucial in many applications, such as multi-robot search and rescue [1], multi-robot intelligent warehouse system [2], autonomous navigation through human crowds [3] and autonomous driving [4]
Sensor-level [43] methods, we use the egocentric local grid map of a robot to represent the environmental information around it including its shape and observable appearances of other robots and obstacles, which can be generated by using multiple sensors or sensor fusion
We propose a map-based deep reinforcement learning (DRL) multi-robot collision avoidance approach in a communication-free environment, where egocentric local grid maps are used to represent the environmental information around the robot, which can be generated by using multiple sensors or sensor fusion

Summary

Introduction

With the rapid development of autonomous mobile robots in recent years, more and more attentions have been paid to multi-robot collision avoidance, which is crucial in many applications, such as multi-robot search and rescue [1], multi-robot intelligent warehouse system [2], autonomous navigation through human crowds [3] and autonomous driving [4]. Inspired by VO-based approaches, Chen et al [40] provide a DRL-based method to train an agent-level collision avoidance policy, where the network still requires the expensive movement data of the ego robot, its neighbors and moving obstacles as its inputs In their extension [41], multiple perception tasks, like segmentation, recognition and tracking, are performed on multiple sensors to estimate the movement data of nearby robots and moving obstacles. We train the collision avoidance policy in multiple simulation environments using DPPO, which can be deployed to real robots without tedious parameter tuning, where the network considers egocentric local grid maps as inputs and directly outputs low-level robot control commands.

Problem Formulation

Approach

Reinforcement Learning Components

Observation Space

Action Space

Reward Function

Distributed Proximal Policy Optimization

Training Process

Network Architecture

Multi-Stage Curriculum Learning

Simulation Experiments

Implementation Details

Generalization Capability

Large-Scale Scenarios

Heterogeneous Robots

Metrics and Scenarios

Quantitative Results

Robustness Evaluation

Different Sensor Noise

Different FOV Limits

Different Sensor Types

Real-World Experiments

Hardware Setup

Static and Dynamic Scenarios

Multi-Robot Scenarios

Long-Range Navigation

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors	Publication Date: Aug 27, 2020
Citations: 24	License type: CC BY 4.0

R Discovery Prime

Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Multi-Robot Collision Avoidance with Map-based Deep Reinforcement Learning
Shunyi Yao ... Lifan Pan
-
Shunyi Yao, et. al.Shunyi Yao ... Lifan Pan
01 Nov 2020
01 Nov 2020

Multi Robot Collision Avoidance with Continuous Curvature Manoeuvres
Tejas Parekh ... Arun Kumar Singh
-
Tejas Parekh, et. al.Tejas Parekh ... Arun Kumar Singh
04 Jul 2013
04 Jul 2013

Multi-robot collision avoidance method in sweet potato fields.
Kang Xu ... Ranbing Yang
Frontiers in plant science | VOL. 15
Kang Xu, et. al.Kang Xu ... Ranbing Yang
10 Sep 2024
Frontiers in plant science | VOL. 15

Decentralized Multi-Robot Collision Avoidance: A Systematic Review from 2015 to 2021
Mehak Raibail ... Mohammad Faidzul Nasrudin
Symmetry | VOL. 14
Mehak Raibail, et. al.Mehak Raibail ... Mohammad Faidzul Nasrudin
18 Mar 2022
Decentralized Multi-Robot Collision Avoidance: A Systematic Review from 2015 to 2021
Mehak Raibail ... Mohammad Faidzul Nasrudin

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Distributed Non-Communicating Multi-Robot Collision Avoidance via Map-Based Deep Reinforcement Learning.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors