Intelligent Vehicle Environment Scene Parsing Method Based on Multi-tasking Convolutional Neural Network

Jing Lian,Jiahao Pi,Yuhang Yin,Yuekai Yang

doi:10.1109/cvci51460.2020.9338621

Abstract

An encoder-decoder convolutional neural network architecture is presented integrating multi-class semantic segmentation and multi-object detection to improve the efficiency and depth of scene parsing of intelligent vehicle. The encoder of the network is designed as a multi-scale structure to improve real-time performance while ensuring the accuracy. The decoders of the network comprise the semantic segmentation and object detection subnetworks, which share encoder feature maps to improve computational efficiency. During the training process, we use FPS (Frames Per Second) and MIoU (Mean Intersection over Union) as the evaluation metrics of semantic segmentation, while the mAP (mean Average Precision) and FPS are used as the performance evaluation indexes of object detection. We conduct separate and joint training on the network to evaluate its performance. Experimental results show that the proposed network can realize multi-class semantic segmentation and multi-object detection simultaneously with better real-time performance and richer feature information, making it highly possible for implementation on real vehicles.

Full Text