Abstract

One of the most important key points in the intelligent transportation systems is scene understanding of the known and unknown surrounding environment to achieve a safe driving for smart mobile robots and cars. Semantic segmentation can address most of the perception needs of mobile robots and Intelligent Vehicles (IV). There are several deep learning approaches based on Convolutional Neural Network (CNN) for semantic segmentation. Most of these techniques have been designed on a pretrained network base and loading a specific weight file is necessary for them. In this paper, we propose a deep architecture for semantic segmentation from scratch based on an asymmetry encoder- decoder architecture using Ghost-Net and U-Net which we have called it Ghost-UNet. This model can be used for precise segmentation using a combination of low-level spatial information and high-level feature maps. We focus our work on outdoor datasets to evaluate the proposed model which is tested on the Cityscapes dataset. The proposed model has good pixel accuracy and mean Intersection over Union (mIoU) compared with other valid literature.

Highlights

  • Intelligent transportation systems are an important part of smart cities for transmuting cities into digital societies

  • Ghost-Unet model is proposed as an asymmetric encoder- decoder architecture for high accuracy semantic segmentation and it has been designed considering a reasonable number of convolutional layers

  • Some of the classes have low mean Intersection over Union (mIoU) because of overlap between classes in the original images, hazy objects, different resolution, and brightness etc which cause to a low accuracy in the training step

Read more

Summary

INTRODUCTION

Intelligent transportation systems are an important part of smart cities for transmuting cities into digital societies. Kazerouni et al.: Ghost-UNet: Asymmetric Encoder-Decoder Architecture segmentation requires fusing dense pixel-level accuracy with multiscale contextual interpretation [4]. It helps self-driving cars and autonomous mobile robots to find paths without collision and detect the main objects of the known and unknown environment. Encoder-decoder structure is a well-known method for semantic segmentation to process the low- and high-level features and build final classified images. Ghost-Unet model is proposed as an asymmetric encoder- decoder architecture for high accuracy semantic segmentation and it has been designed considering a reasonable number of convolutional layers.

RELATED WORK
DISCUSSION AND RESULTS
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call