Abstract

Using satellite-based SAR (Synthetic Aperture Radar) imagery to detect and track ships is a formidable challenge. However, accurate analysis is hampered by inherent difficulties such as obscured edges, multiple targets, varying dimensions and complex backgrounds. These factors contribute to a poor signal-to-noise ratio. Various artificial intelligence models have been developed to improve these problems, especially with YOLO-based models. To overcome the above challenges and achieve robust performance, this study presents a model called SwinYOLOv7, a novel fusion of the YOLOv7 framework with the Features Pyramid Network and the Swin Transformer using ground-breaking anchor-free detection algorithms. This innovative method aims to increase the accuracy of vessel detection while reducing the impact of background clutter. The proposed model improved by YOLOv7 examined three different datasets in detail: SRSDD-v1.0, HRSID, and SSDD to optimize the performance of the model. The training process was consistently verified using superior recall, precision, and F1-score values, which can be easily compared with previous studies. The results show that the model using the Swin Transformer attention mechanism and using an image size of 640×640 achieves the highest accuracy of 96.59%. Alternative attention mechanisms, including the Squeeze-and-Excitation Network (SEnet), the Convolutional Block Attention Module (CBAM), Channel Attention (CA) and Efficient Channel Attention (ECA), deliver poorer accuracy rates. The combination of YOLOv7 and Swin Transformer yielded encouraging results that enabled the proposed model to outperform the current benchmarking models. Therefore, the proposed model provides a compelling solution to ensure accurate vessel identification in complex search and rescue scenarios.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call