Abstract In complex traffic sign environments, detection challenges include uneven size distribution, insufficient valid information, and difficulties in identifying targets under resource constraints, leading to missed and false detections. This study proposes an enhanced lightweight traffic sign detection algorithm based on SSD (Single Shot MultiBox Detector). By replacing the original backbone network with MobileNetV2, the model is streamlined to have fewer parameters, which improves generalization in complex environments. This modification significantly boosts the recall rate, achieving a better balance between precision and recall. By introducing the FPN(Feature Pyramid Network) combined with the CBAM(Convolutional Block Attention Module) attention mechanism, the detailed and semantic information between deep and shallow layers is fully integrated, reducing the loss of feature information, thus enhancing the strengthening of key information of traffic signs and the adaptability to different scales of traffic signs. Finally, by integrating the cross-attention mechanism, the algorithm's anti-interference ability in complex environments is improved, and the positioning accuracy of traffic signs is enhanced by capturing the dependency between different positions. Through ablation experiments and comparative experiments on a public traffic sign dataset, our improved SSD algorithm achieved an mAP of 89.97%. Compared with the original algorithm, the mAP increased by 12.41%, the recall rate increased by 18.38%, and the sum of precision and recall F1 increased by 14.6%. These improvements significantly enhance the performance of traffic sign detection in complex environments, thereby meeting the performance requirements of traffic sign detection.
Read full abstract