Abstract Target tracking based on sonar images plays a crucial role in underwater coordinated operations and strikes. In this paper, aiming to address the problem of underwater noise interference and target boundary blurring in sonar image, we propose an enhanced SiamMask network, SiamMask-RAM, for underwater target tracking. By combining hybrid attention and cross correlation structures, a new similarity metric branch is proposed to enhance perception of underwater target boundaries. Furthermore, by combining the confidence scores of the positive samples with the IoU, a ranking loss optimization strategy is proposed to reduce the possibility of mismatch between the classification and regression. Exceptionally, we constructed an underwater typical target dataset based on sonar images. The evaluation results on the sonar dataset demonstrate that SiamMask-RAM achieves a tracking accuracy of 0.752, an expected average overlap of 0.362, which are 22.6% and 21.9% better than the SiamRPN tracking algorithm, respectively. These findings indicate that the proposed method exhibits high accuracy and robustness in underwater tracking scenarios involving sonar images.
Read full abstract