Abstract

It is a challenging task for self-driving vehicles in Real-World traffic scenarios to find a trade-off between the real-time performance and the high accuracy of the detection, recognition, and tracking in videos. This issue is addressed in this paper with an improved YOLOv3 (You Only Look Once) and a multi-object tracking algorithm (Deep-Sort). First, data augmentation is employed for small sample traffic signs to address the problem of an extremely unbalanced distribution of different samples in the dataset. Second, a new architecture of YOLOv3 is proposed to make it more suitable for detecting small targets. The detailed method is (1) removing the output feature map corresponding to the 32-times subsampling of the input image in the original YOLOv3 structure to reduce its computational costs and improve its real-time performances; (2) adding an output feature map of 4-times subsampling to improve its detection capability for the small traffic signs; (3) Deep-Sort is integrated into the detection method to improve the precision and robustness of multi-object detection, and the tracking ability in videos. Finally, our method demonstrated better detection capabilities, with respect to state-of-the-art approaches, which precision, recall and mAP is 91%, 90%, and 84.76% respectively.

Highlights

  • IntroductionTraffic sign detection can be broadly divided into two categories [1,2,3,4,5,6]

  • Traffic sign detection and tracking is a critical task of self-driving vehicles in RealWorld traffic scenarios, which provides real-time decision support for the autopilot system.Traffic sign detection can be broadly divided into two categories [1,2,3,4,5,6]

  • It is still a challenge to compromise between computational cost and accuracy in It is still a challenge to compromise between computational cost and accuracy in small targets detecting for self-driving vehicles in actual scenarios

Read more

Summary

Introduction

Traffic sign detection can be broadly divided into two categories [1,2,3,4,5,6]. One is the traditional method based on manual features [1,2,3,4,5], and the other is the deep learning algorithm based on CNN (Convolutional Neural Network) [6]. In [2,3], RGB (Red, Green, Blue) and HSI (Hue, Saturation, Intensity) color model methods are used for detection owing to different color information (red, yellow, blue) for multifarious traffic signs. Deep learning-based object detection algorithms are more accurate and capable of evolving to more complex environments [6]

Objectives
Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.