A deep multi-agent reinforcement learning framework for autonomous aerial navigation to grasping points on loads

Jingyu Chen,Ruidong Ma,John Oyekan

doi:10.1016/j.robot.2023.104489

Abstract

Deep reinforcement learning, by taking advantage of neural networks, has made great strides in the continuous control of robots. However, in scenarios where multiple robots are required to collaborate with each other to accomplish a task, it is still challenging to build an efficient and scalable multi-agent control system due to increasing complexity. In this paper, we regard each unmanned aerial vehicle (UAV) with its manipulator as one agent, and leverage the power of multi-agent deep deterministic policy gradient (MADDPG) for the cooperative navigation and manipulation of a load. We propose solutions for addressing navigation to grasping point problem in targeted and flexible scenarios, and mainly focus on how to develop model-free policies for the UAVs without relying on a trajectory planner. To overcome the challenges of learning in scenarios with an increasing number of grasping points, we incorporate the demonstrations from an Optimal Reciprocal Collision Avoidance (ORCA) algorithm into our framework to guide the policy training and adapt two novel techniques into the architecture of MADDPG. Furthermore, curriculum learning with the attention mechanism is utilized by reusing knowledge from fewer grasping points to facilitate the training of a load with more points. Our experiments were validated by a load with three, four and six grasping points respectively in Coppeliasim simulator and then transferred into the real world with Crazyflie quadrotors. Our results show that the average tracking deviations from the desirable grasping point to the final position of the UAV can be less than 10 cm in some real-world experiments. Compared with state-of-the-art model-free reinforcement learning and swarm optimization algorithms, results show that our proposed methods outperform other baselines with a reasonable success rate especially in the scenarios with more grasping points. Furthermore, the learned optimal policies enable UAVs to reach and hover over all the grasping points before manipulation without any collision. We conducted a comprehensive analysis of both targeted and flexible navigation, highlighting their respective advantages and disadvantages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Robotics and Autonomous Systems	Publication Date: Jul 10, 2023
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

A deep multi-agent reinforcement learning framework for autonomous aerial navigation to grasping points on loads

Abstract

Talk to us

Similar Papers

More From: Robotics and Autonomous Systems

Lead the way for us

Similar Papers

Building a Connected Communication Network for UAV Clusters Using DE-MADDPG
Zixiong Zhu ... Nianhao Xie
Symmetry | VOL. 13
Zixiong Zhu, et. al.Zixiong Zhu ... Nianhao Xie
20 Aug 2021
Symmetry | VOL. 13

Game Combined Multi-Agent Reinforcement Learning Approach for UAV Assisted Offloading
Ang Gao ... Wei Liang
IEEE Transactions on Vehicular Technology | VOL. 70
Ang Gao, et. al.Ang Gao ... Wei Liang
01 Dec 2021
IEEE Transactions on Vehicular Technology | VOL. 70

Three-Dimensional Trajectory and Resource Allocation Optimization in Multi-Unmanned Aerial Vehicle Multicast System: A Multi-Agent Reinforcement Learning Method
Dongyu Wang ... Hongda Yu
Drones | VOL. 7
Dongyu Wang, et. al.Dongyu Wang ... Hongda Yu
19 Oct 2023
Drones | VOL. 7

Computation Offloading and Resource Allocation Based on Multi-agent Federated Learning
Yiming Yao ... Zheyuan Hu
-
Yiming Yao, et. al.Yiming Yao ... Zheyuan Hu
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A deep multi-agent reinforcement learning framework for autonomous aerial navigation to grasping points on loads

Abstract

Talk to us

Similar Papers

More From: Robotics and Autonomous Systems