Sem-Aug: Improving Camera-LiDAR Feature Fusion With Semantic Augmentation for 3D Vehicle Detection

Lin Zhao,Yufeng Yue,Meiling Wang

doi:10.1109/lra.2022.3191208

Abstract

Camera-LiDAR fusion provides precise distance measurements and fine-grained textures, making it a promising option for 3D vehicle detection in autonomous driving scenarios. Previous camera-LiDAR based 3D vehicle detection approaches mainly focused on employing image-based pre-trained models to fetch semantic features. However, these methods may perform inferior to the LiDAR-based ones when lacking semantic segmentation labels in autonomous driving tasks. Motivated by this observation, we propose a novel semantic augmentation method, namely Sem-Aug, to guide high-confidence camera-LiDAR fusion feature generation and boost the performance of multimodal 3D vehicle detection. The key novelty of semantic augmentation lies in the 2D segmentation mask auto-labeling, which provides supervision for semantic segmentation sub-network to mitigate the poor generalization performance of camera-LiDAR fusion. Using semantic-augmentation-guided camera-LiDAR fusion features, Sem-Aug achieves remarkable performance on the representative autonomous driving KITTI dataset compared to both the LiDAR-based baseline and previous multimodal 3D vehicle detectors. Qualitative and quantitative experiments demonstrate that Sem-Aug provides significant improvements in challenging <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Hard</i> detection scenarios caused by occlusion and truncation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sem-Aug: Improving Camera-LiDAR Feature Fusion With Semantic Augmentation for 3D Vehicle Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters

Lead the way for us

Journal: IEEE Robotics and Automation Letters	Publication Date: Oct 1, 2022
Citations: 15

Similar Papers

Image guidance based 3D vehicle detection in traffic scene
Deyun Dai ... Hao Zhao
Neurocomputing | VOL. 428
Deyun Dai, et. al.Deyun Dai ... Hao Zhao
08 Dec 2020
Neurocomputing | VOL. 428

CenterLoc3D: monocular 3D vehicle localization network for roadside surveillance cameras
Xinyao Tang ... Chunhui Zhao
Complex & Intelligent Systems | VOL. 9
Xinyao Tang, et. al.Xinyao Tang ... Chunhui Zhao
03 Jan 2023
Complex & Intelligent Systems | VOL. 9

Joint Monocular 3D Vehicle Detection and Tracking
Hou-Ning Hu ... Trevor Darrell
-
Hou-Ning Hu, et. al.Hou-Ning Hu ... Trevor Darrell
01 Oct 2019
01 Oct 2019

3D Vehicle Information Recognition Algorithm of Monocular Camera Based onSelf-Calibration in Traffic Scene
Xinyao Tang ... Hua Cui
Journal of Computer-Aided Design & Computer Graphics | VOL. 32
Xinyao Tang, et. al.Xinyao Tang ... Hua Cui
01 Aug 2020
Journal of Computer-Aided Design & Computer Graphics | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sem-Aug: Improving Camera-LiDAR Feature Fusion With Semantic Augmentation for 3D Vehicle Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Robotics and Automation Letters