Abstract

In maritime search and rescue (SAR), the planning of the search path will directly affect the efficiency of searching for people overboard in the search area. However, traditional SAR decision-making schemes often adopt a fixed search path planning mode, but the limits are poor flexibility, low efficiency, and insufficient intelligence. This paper plans a search path with the shortest time-consuming and priority coverage of high-probability areas, considering complete coverage of maritime SAR areas and avoiding maritime obstacles. Firstly, a maritime SAR environment model is built using marine environmental field data and electronic charts. Secondly, an autonomous coverage path planning model for maritime SAR is proposed based on reinforcement learning, in which a reward function with multiple constraints is designed to guide the navigation action of the vessel agent. In the iterative training process of the path planning model, the random action selection probability is dynamically adjusted by the nonlinear action selection policy to ensure the stable convergence of the model. Finally, the experimental verification is conducted in different small-scale maritime SAR simulation scenarios. The results indicate that the search path can cover the high-probability areas preferentially with lower repeated coverage and shorter path length compared with other path planning algorithms.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call