To address the problems of missed and false detections caused by target occlusion and lighting variations, this paper proposes a recognition model based on YOLOv5 with bi-level routing attention to achieve precise real-time small object recognition, using the problem of tailhook recognition for carrier-based aircraft as a representative application. Firstly, a module called D_C3, which combines deformable convolution, was integrated into the backbone network to enhance the model’s learning ability and adaptability in specific scenes. Secondly, a bi-level routing attention mechanism was employed to dynamically focus on the regions of the feature map that are more likely to contain the target, leading to more accurate target localization and classification. Additionally, the loss function was optimized to accelerate the bounding box regression process. The experimental results on the self-constructed CATHR-DET and the public VOC dataset demonstrate that the proposed method outperforms the baselines in overall performance.
Read full abstract