Abstract The development of traffic sign recognition (TSR) has become increasingly important for enhancing the safety and convenience of assisted driving. To achieve high accuracy, faster inference speed, and a lightweight model, an improved lightweight TSR network, termed YOLOv8-ALWP, has been proposed. This network incorporates adaptive downsampling to replace the original convolution module in YOLOv8. By employing multiple pooling and convolution operations, it reduces the spatial resolution to extract additional feature information. To accommodate the varying scale characteristics of different traffic signs, large separable kernel attention is introduced to enhance spatial pyramid pooling-fast. Furthermore, the complete intersection over union loss has been improved, and a new Wise-Focaler-EIoU Loss has been proposed to accelerate model convergence and enhance generalization capabilities. Finally, layer-adaptive sparsity for magnitude-based pruning is employed to reduce the model’s parameters, decrease computational complexity, and improve inference speed. Experiments were conducted using the TT100K, Roadsign, CCTSDB, and GTSDB datasets. In the TT100K dataset, compared to the baseline model, the improved algorithm significantly reduced parameters by 64.67%, FLOPs by 44.44%, and increased mAP by 1.7%, precision by 5.5%, and FPS from 70.3 to 81.7, respectively. Under four specific conditions, the improved algorithm effectively addressed the shortcomings of the baseline model, such as missed detections and reduced accuracy. These experimental results indicate that the YOLOv8-ALWP algorithm achieves model lightweighting while enhancing detection accuracy.
Read full abstract