Due to the complexity and diversity of underwater environments, traditional object detection algorithms face challenges in maintaining robustness and detection accuracy when applied underwater. This paper proposes an underwater object detection algorithm based on an improved YOLOv8 model. First, the introduction of CIB building blocks into the backbone network, along with the optimization of the C2f structure and the incorporation of large-kernel depthwise convolutions, effectively enhances the model’s receptive field. This improvement increases the capability of detecting multi-scale objects in complex underwater environments without adding a computational burden. Next, the incorporation of a Partial Self-Attention (PSA) module at the end of the backbone network enhances model efficiency and optimizes the utilization of computational resources while maintaining high performance. Finally, the integration of the Neck component from the Gold-YOLO model improves the neck structure of the YOLOv8 model, facilitating the fusion and distribution of information across different levels, thereby achieving more efficient information integration and interaction. Experimental results show that YOLOv8-CPG significantly outperforms the traditional YOLOv8 in underwater environments. Precision and Recall show improvements of 2.76% and 2.06%. Additionally, mAP50 and mAP50-95 metrics have increased by 1.05% and 3.55%, respectively. Our approach provides an efficient solution to the difficulties encountered in underwater object detection.
Read full abstract