Real-Time Driving Scene Semantic Segmentation

Wenfu Wang,Yueting Zhuang,Yongjian Fu,Zhijie Pan,Xi Li

doi:10.1109/access.2020.2975640

Wenfu Wang, Yueting Zhuang + Show 3 more

Open Access

https://doi.org/10.1109/access.2020.2975640

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 25	License type: CC BY 4.0

Affiliation: Zhejiang University of Science and Technology

Abstract

Real-time understanding of surrounding environment is an essential yet challenging task for autonomous driving system. The system must not only deliver accurate result but also low latency performance. In this paper, we focus on the task of fast-and-accurate semantic segmentation. An efficient and powerful deep neural network termed as Driving Segmentation Network (DSNet) and a novel loss function Object Weighted Focal Loss are proposed. In designing DSNet, our goal is to achieve the best capacity with constrained model complexity. We design efficient and powerful unit inspired by ShuffleNet V2 and also integrate many successful techniques to achieve excellent balance between accuracy and speed. DSNet has 0.9 million of parameters, achieves 71.8% mean Intersection-over-Union (IoU) on Cityscapes validation set, 69.3% on test set, and runs 100+ frames per second (FPS) at resolution 640 × 360 on NVIDIA 1080Ti. In order to improve performance on minor and hard objects which are crucial in driving scene, Object Weighted Focal Loss (OWFL) is proposed to deal with the serious class imbalance issue in pixel-wise segmentation task. It could effectively improve the overall mean IoU of minor and hard objects by increasing loss contribution from them. Experiments show that DSNet performs 2.7% points higher on minor and hard objects compared with fast-and-accurate model ERFNet under similar accuracy. These traits imply that DSNet has great potential for practical autonomous driving application.

Highlights

An autonomous vehicle must immediately, accurately and comprehensively understand the complex surrounding environment, which poses great challenge to driving perception system
Inference speed could vary in different software and hardware settings, two indirect metrics are usually evaluated in lightweight Convolutional Neural Network (CNN) models: number of parameters and number of float-point operations (FLOPs)
To show the effectiveness of the proposed loss function, we conduct experiments with four different loss functions: class weighted cross entropy (WCE), class weighted cross entropy and semantic encoding Loss (WCE+Semantic Encoding Loss (SEL)), focal loss and semantic encoding loss (FL+SEL), and Object Weighted Focal Loss (OWFL) and semantic encoding loss (OWFL+SEL). 19 trainable classes in Cityscapes dataset are grouped into 3 categories according to the γi value which represents the object’s frequency in the whole dataset

Summary

INTRODUCTION

An autonomous vehicle must immediately, accurately and comprehensively understand the complex surrounding environment, which poses great challenge to driving perception system. With the number of parameters under 0.4M we can not achieve mean IoU higher than 62% on Cityscapes dataset Such few parameters could lead to unsatisfying result of critical objects in the driving scene, for example bicycle in ENet [7] scores 34.1% which is too low to provide accurate information for safe autonomous driving. We aim to propose a fast-and-accurate model for practical use It should achieve excellent balance between accuracy and inference speed, and focus on improving the performance of hard and minor objects. We design efficient and powerful unit and asymmetric encoder-decoder architecture inspired by ShuffleNet V2 [15] and ENet [7], and propose a lightweight model Driving Segmentation Network (DSNet).

RELATED WORK

LOSS FUNCTION

DATASET AND EVALUATION METRICS

ABLATION STUDY OF LOSS FUNCTION

Findings

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-Time Driving Scene Semantic Segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multifeature Selective Fusion Network for Real-Time Driving Scene Parsing
Yu Pei ... Shutao Li
IEEE Transactions on Instrumentation and Measurement | VOL. 70
Yu Pei, et. al.Yu Pei ... Shutao Li
01 Jan 2020
IEEE Transactions on Instrumentation and Measurement | VOL. 70

Efficient Semantic Segmentation Using Spatio-Channel Dilated Convolutions
Jaeseon Kim ... Yong Seok Heo
IEEE Access | VOL. 7
Jaeseon Kim, et. al.Jaeseon Kim ... Yong Seok Heo
01 Jan 2019
IEEE Access | VOL. 7

Semantic segmentation of remote sensing images based on dilated convolution and spatial-channel attention mechanism
Huazhong Jin ... Tingtao Zhang
Journal of Applied Remote Sensing | VOL. 17
Huazhong Jin, et. al.Huazhong Jin ... Tingtao Zhang
29 Mar 2023
Journal of Applied Remote Sensing | VOL. 17

EfficientNet family U-Net models for deep learning semantic segmentation of kidney tumors on CT images
Abubaker Abdelrahman ... Serestina Viriri
Frontiers in Computer Science | VOL. 5
Abubaker Abdelrahman, et. al.Abubaker Abdelrahman ... Serestina Viriri
07 Sep 2023
Frontiers in Computer Science | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-Time Driving Scene Semantic Segmentation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access