In the realm of autonomous driving, practical driving scenarios are fraught with numerous complexities, including inclement weather conditions, nighttime blurriness, and ambient light sources that significantly hinder drivers’ ability to discern road indicators. Furthermore, the dynamic nature of road indicators, which are constantly evolving, poses additional challenges for computer vision-based detection systems. To address these issues, this paper introduces a road indicator light detection model, leveraging the advanced capabilities of YOLOv8. We have ingeniously integrated the robust backbone of YOLOv8 with four distinct attention mechanism modules—Convolutional Block Attention Module (CBAM), Efficient Channel Attention (ECA), Shuffle Attention (SA), and Global Attention Mechanism (GAM)—to significantly enhance the model performance in capturing nuanced features of road indicators and boosting the accuracy of detecting minute objects. Additionally, we employ the Asymptotic Feature Pyramid Network (AFPN) strategy, which optimizes the fusion of features across different scales, ensuring not only an enhanced performance but also maintaining real-time capability. These innovative attention modules empower the model by recalibrating the significance of both channel and spatial dimensions within the feature maps, enabling it to hone in on the most pertinent object characteristics. To tackle the challenges posed by samples rich in small, occluded, background-similar objects, and those that are inherently difficult to recognize, we have incorporated the Focaler-IOU loss function. This loss function deftly reduces the contribution of easily detectable samples to the overall loss, thereby intensifying the focus on challenging samples. This strategic balancing of hard-to-detect versus easy-to-detect samples effectively elevates the model’s detection performance. Experimental evaluations conducted on both a public traffic signal dataset and a proprietary headlight dataset have yielded impressive results, with both mAP50 and mAP50:95 metrics experiencing significant improvements exceeding two percentage points. Notably, the enhancements observed in the headlight dataset are particularly profound, signifying a significant step forward toward realizing safer and more reliable assisted driving technologies.
Read full abstract