Crowd Navigation in an Unknown and Dynamic Environment Based on Deep Reinforcement Learning

Libo Sun,Wenhu Qin,Jinfeng Zhai

doi:10.1109/access.2019.2933492

Abstract

This paper presents an approach for solving the crowd navigation problem in an unknown and dynamic environment based on deep reinforcement learning. In our approach, we first make four leader agents learn how to reach their goals and avoid collisions with static and dynamic obstacles in an unknown environment by use of proximal policy optimization combined with Long short-term memory and a collision prediction algorithm. In the second stage, we make each leader agent arrive at a specific goal several times and record its trajectory as the guiding path so that the members in its group know how to reach their goals. We adopt the Reciprocal Velocity Obstacle algorithm to make agents not collide with others. Finally, we simulate the scenario of four groups moving towards their goals simultaneously using the Unity 3D engine. The experimental results demonstrate self-learning ability of a crowd who can reach their goals successfully in an unknown and dynamic environment.

Highlights

Crowd simulation has been gaining considerable attention due to its applications in entertainment, education, architecture, training, urban engineering and virtual heritage
We adopt the Reciprocal Velocity Obstacle (RVO) algorithm to make agents not collide with others and we simulate the scenario of four groups moving towards their goals simultaneously
Our method is inspired by the deep reinforcement learning technique, which solves the problems that the action space is discrete and it is intractable when the state space is high dimensional in Q learning methods, and finds feasible paths to reach their goals for crowds while avoiding collisions with static and dynamic obstacles through giving the reward and the penalty

Summary

INTRODUCTION

Crowd simulation has been gaining considerable attention due to its applications in entertainment, education, architecture, training, urban engineering and virtual heritage. Path planning and decision making can guarantee that agents reach their goals without colliding with obstacles and other agents in an optimal way and it is a very important aspect of crowd simulation that researchers should put great effort into. We first make four leader agents learn how to reach their goals and avoid the collisions with static and dynamic obstacles in an unknown environment by use of Proximal Policy Optimization (PPO)[35] combined with long short-term memory (LSTM)[34] and a collision prediction algorithm[38]. Agents can learn how to avoid collisions with static obstacles in an unknown environment, even if the positions of static obstacles change, which shows that our agents adopt the self-learning ability of human beings.

RELATED WORKS

POLICY REPRESENTATION

RESULTS

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 32	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Crowd Navigation in an Unknown and Dynamic Environment Based on Deep Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Autonomous mobile robot navigation in uncertain dynamic environments based on deep reinforcement learning
Zhangfan Lu ... Ran Huang
-
Zhangfan Lu, et. al.Zhangfan Lu ... Ran Huang
15 Jul 2021
15 Jul 2021

Mobile Robot Navigation Using Deep Reinforcement Learning
Min-Fan Ricky Lee ... Sharfiden Hassen Yusuf
Processes | VOL. 10
Min-Fan Ricky Lee, et. al.Min-Fan Ricky Lee ... Sharfiden Hassen Yusuf
19 Dec 2022
Processes | VOL. 10

A learning method for AUV collision avoidance through deep reinforcement learning
Jian Xu ... Xue Du
Ocean Engineering | VOL. 260
Jian Xu, et. al.Jian Xu ... Xue Du
03 Aug 2022
Ocean Engineering | VOL. 260

Path Planning for Autonomous Vehicles in Unknown Dynamic Environment Based on Deep Reinforcement Learning
Hui Hu ... Wenjie Tong
Applied Sciences | VOL. 13
Hui Hu, et. al.Hui Hu ... Wenjie Tong
06 Sep 2023
Applied Sciences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Crowd Navigation in an Unknown and Dynamic Environment Based on Deep Reinforcement Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access