Abstract

The formation of multiple Unmanned Surface Vehicles (USVs) is an effective way to extend the capabilities of a single USV to satisfy relatively complex tasks in practice. In this study, we proposed a formation-strategy-based deep reinforcement learning method called Multi-agent Deterministic Policy Gradient (MADDPG) to realize multi-USV formation. In this work, Line of Sight (LOS) guidance is integrated into the formation strategy under a leader-follower scheme. With the advantage of ignoring the dynamic model of the USV, the proposed formation strategy has strong migration potential to be transferred to other multi-agent systems. To evaluate the performance of the multi-USV formation, we designed two different scenarios in line with the practical tasks carried out with the multi-USV system covering observation aperture enhancement with the desired formation and dynamic non-cooperative target roundup. The performance of the proposed multi-USV formation strategy was demonstrated in both a simulation environment and a real-world environment. Compared with other deep reinforcement learning-inspired and traditional approaches, our proposed strategy based on MADDPG achieved a higher task success rate. It also outperformed the Deep Deterministic Policy Gradient (DDPG) in other metrics because it can acquire knowledge more effectively from dynamic environments by observing joint information and from the centralized training.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call