Abstract

Convolutional Neural Networks (CNN) achieves perfection in traffic sign identification with enough annotated training data. The dataset determines the quality of the complete visual system based on CNN. Unfortunately, databases for traffic signs from the majority of the world's nations are few. In this scenario, Generative Adversarial Networks (GAN) may be employed to produce more realistic and varied training pictures to supplement the actual arrangement of images. The purpose of this research is to describe how the quality of synthetic pictures created by DCGAN, LSGAN, and WGAN is determined. Our work combines synthetic images with original images to enhance datasets and verify the effectiveness of synthetic datasets. We use different numbers and sizes of images for training. Likewise, the Structural Similarity Index (SSIM) and Mean Square Error (MSE) were employed to assess picture quality. Our study quantifies the SSIM difference between the synthetic and actual images. When additional images are used for training, the synthetic image exhibits a high degree of resemblance to the genuine image. The highest SSIM value was achieved when using 200 total images as input and 32×32 image size. Further, we augment the original picture dataset with synthetic pictures and compare the original image model to the synthesis image model. For this experiment, we are using the latest iterations of Yolo, Yolo V3, and Yolo V4. After mixing the real image with the synthesized image produced by LSGAN, the recognition performance has been improved, achieving an accuracy of 84.9% on Yolo V3 and an accuracy of 89.33% on Yolo V4.

Highlights

  • Traffic sign identification has emerged as a critical study area in the science of computer vision in recent years

  • The third groups combined the original image with synthetic image produced by Least Squares Generative Adversarial Networks (LSGAN)

  • The synthetic picture created using different Generative Adversarial Networks (GAN) techniques will be utilized for training and combined with the actual picture to improve the performance of the traffic sign recognition system

Read more

Summary

Introduction

Traffic sign identification has emerged as a critical study area in the science of computer vision in recent years. Many researchers have done extensive research and discussion on the identification of traffic signs. They provide many data sets for public use, such as the German Traffic Signals Dataset (GTSRB) [5], [6], the Chinese Traffic Sign Database (TSRD) and Tsinghua Tencent 100K (TT100K) [7]. Obtaining a huge quantity of highquality images of traffic signs is not straightforward [8], [9]. It takes a considerable amount of time, whether it is using a dashcam or on-site filming.

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call