Multi-Task Visual Perception for Object Detection and Semantic Segmentation in Intelligent Driving

Jiao Zhan,Jingnan Liu,Chi Guo,Yejun Wu

doi:10.3390/rs16101774

Abstract

With the rapid development of intelligent driving vehicles, multi-task visual perception based on deep learning emerges as a key technological pathway toward safe vehicle navigation in real traffic scenarios. However, due to the high-precision and high-efficiency requirements of intelligent driving vehicles in practical driving environments, multi-task visual perception remains a challenging task. Existing methods typically adopt effective multi-task learning networks to concurrently handle multiple tasks. Despite the fact that they obtain remarkable achievements, better performance can be achieved through tackling existing problems like underutilized high-resolution features and underexploited non-local contextual dependencies. In this work, we propose YOLOPv3, an efficient anchor-based multi-task visual perception network capable of handling traffic object detection, drivable area segmentation, and lane detection simultaneously. Compared to prior works, we make essential improvements. On the one hand, we propose architecture enhancements that can utilize multi-scale high-resolution features and non-local contextual dependencies for improving network performance. On the other hand, we propose optimization improvements aiming at enhancing network training, enabling our YOLOPv3 to achieve optimal performance via straightforward end-to-end training. The experimental results on the BDD100K dataset demonstrate that YOLOPv3 sets a new state of the art (SOTA): 96.9% recall and 84.3% mAP50 in traffic object detection, 93.2% mIoU in drivable area segmentation, and 88.3% accuracy and 28.0% IoU in lane detection. In addition, YOLOPv3 maintains competitive inference speed against the lightweight YOLOP. Thus, YOLOPv3 stands as a robust solution for handling multi-task visual perception problems. The code and trained models have been released on GitHub.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: May 16, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Multi-Task Visual Perception for Object Detection and Semantic Segmentation in Intelligent Driving

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

YOLOPX: Anchor-free multi-task learning network for panoptic driving perception
Jiao Zhan ... Jingnan Liu
Pattern Recognition | VOL. 148
Jiao Zhan, et. al.Jiao Zhan ... Jingnan Liu
22 Nov 2023
Pattern Recognition | VOL. 148

A Holistically-Guided Decoder for Deep Representation Learning With Applications to Semantic Segmentation and Object Detection.
Jianbo Liu ... Shuai Yi
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Jianbo Liu, et. al.Jianbo Liu ... Shuai Yi
01 Oct 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

AutoNav: A Lane and Object Detection Model for Self-Driving Cars
S Sree Madhumitha ... Jyoti R Munavalli
-
S Sree Madhumitha, et. al.S Sree Madhumitha ... Jyoti R Munavalli
09 Sep 2022
09 Sep 2022

GC-Net: Gridding and Clustering for Traffic Object Detection With Roadside LiDAR
Liwen Zhang ... Yanyun Tao
IEEE Intelligent Systems | VOL. 36
Liwen Zhang, et. al.Liwen Zhang ... Yanyun Tao
15 May 2020
IEEE Intelligent Systems | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Task Visual Perception for Object Detection and Semantic Segmentation in Intelligent Driving

Abstract

Talk to us

Similar Papers

More From: Remote Sensing