Abstract

Vision-based traffic sign detection plays a crucial role in intelligent transportation systems. Recently, many approaches based on deep learning for traffic sign detection have been proposed and showed better performance compared with traditional approaches. However, due to difficult conditions in driving environment and the size of traffic signs in traffic scene images, the performance of deep learning-based methods on small traffic sign detection is still limited. In addition, the inference speed of current state-of-the-art approaches on traffic sign detection is still slow. This paper proposes a deep learning-based approach to improve the performance of small traffic sign detection in driving environments. First, a lightweight and efficient architecture is adopted as the base network to address the issue of the inference speed. To enhance the performance on small traffic sign detection, a deconvolution module is adopted to generate an enhanced feature map by aggregating a lower-level feature map with a higher-level feature map. Then, two improved region proposal networks are used to generate proposals from the highest-level feature map and the enhanced feature map. The proposed improved region proposal network is designed for fast and accuracy proposal generation. In the experiments, the German Traffic Sign Detection Benchmark dataset is used to evaluate the effectiveness of each enhanced module, and the Tsinghua-Tencent 100K dataset is used to compare the effectiveness of the proposed approach with other state-of-the-art approaches on traffic sign detection. Experimental results on Tsinghua-Tencent 100K dataset show that the proposed approach achieves competitive performance compared with current state-of-the-art approaches on traffic sign detection while being faster and simpler.

Highlights

  • Vision-based traffic sign recognition plays an essential role in intelligent transport systems such as an automated driving system and an advanced driver assistance system

  • The improved region proposal network includes a 1 × 1 convolution layer to reduce the number of parameters in the subsequent convolutional layers and a 3 × 3 dilated convolution to enlarge the receptive field, improving the detection accuracy and the inference speed of the proposal generation stage

  • In order to compare the effectiveness of the proposed approach with other state-of-the-art approaches on traffic sign detection, this paper conducts experiments on two public datasets: German Traffic Sign Detection Benchmark and TsinghuaTencent 100K

Read more

Summary

Introduction

Vision-based traffic sign recognition plays an essential role in intelligent transport systems such as an automated driving system and an advanced driver assistance system. The accuracy of traffic sign detection has a dramatic effect on the accuracy of the whole system. Many approaches have been proposed to detect traffic sign [1]. Traditional methods [2,3,4,5,6,7] are usually based on hand-crafted features such as color, texture, edge, and other low-level features to detect traffic sign in an image. Due to the diversity of the traffic sign appearance, the occlusion of traffic sign by other objects, and the effect of lighting conditions, traditional methods for traffic sign detection showed poor performance

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call