PIDFusion: Fusing Dense LiDAR Points and Camera Images at Pixel-Instance Level for 3D Object Detection

Zheng Zhang,Ruyu Xu,Qing Tian

doi:10.3390/math11204277

Abstract

In driverless systems (scenarios such as subways, buses, trucks, etc.), multi-modal data fusion, such as light detection and ranging (LiDAR) points and camera images, is essential for accurate 3D object detection. In the fusion process, the information interaction between the modes is challenging due to the different coordinate systems of various sensors and the significant difference in the density of the collected data. It is necessary to fully consider the consistency and complementarity of multi-modal information, make up for the gap between multi-source data density, and achieve the joint interactive processing of multi-source information. Therefore, this paper is based on Transformer to improve a new multi-modal fusion model called PIDFusion for 3D object detection. Firstly, the method uses the results of 2D instance segmentation to generate dense 3D virtual points to enhance the original sparse 3D point clouds. This optimizes the issue that the nearest Euclidean distance in the 2D image space cannot ensure the nearest in the 3D space. Secondly, a new cross-modal fusion architecture is designed to maintain individual per-modality features to take advantage of their unique characteristics during 3D object detection. Finally, an instance-level fusion module is proposed to enhance semantic consistency through cross-modal feature interaction. Experiments show that PIDFusion is far ahead of existing 3D object detection methods, especially for small and long-range objects, with 70.8 mAP and 73.5 NDS on the nuScenes test set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Oct 13, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

PIDFusion: Fusing Dense LiDAR Points and Camera Images at Pixel-Instance Level for 3D Object Detection

Abstract

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Leveraging Self-Paced Semi-Supervised Learning with Prior Knowledge for 3D Object Detection on a LiDAR-Camera System
Pei An ... Siwen Quan
Remote Sensing | VOL. 15
Pei An, et. al.Pei An ... Siwen Quan
20 Jan 2023
Remote Sensing | VOL. 15

3D object detection for autonomous driving from image: a survey——benchmarks, constraints and error analysis
Xiying Li ... Jianwu Dang
Journal of Image and Graphics | VOL. 28
Xiying Li, et. al.Xiying Li ... Jianwu Dang
01 Jan 2023
Journal of Image and Graphics | VOL. 28

PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving
Wenqi Zheng ... Hyunchul Shin
Applied Sciences | VOL. 12
Wenqi Zheng, et. al.Wenqi Zheng ... Hyunchul Shin
06 Apr 2022
Applied Sciences | VOL. 12

Geometric information constraint 3D object detection from LiDAR point cloud for autonomous vehicles under adverse weather
Yuanfan Qi ... Dazhi Wang
Transportation Research Part C: Emerging Technologies | VOL. 161
Yuanfan Qi, et. al.Yuanfan Qi ... Dazhi Wang
16 Mar 2024
Transportation Research Part C: Emerging Technologies | VOL. 161

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PIDFusion: Fusing Dense LiDAR Points and Camera Images at Pixel-Instance Level for 3D Object Detection

Abstract

Talk to us

Similar Papers

More From: Mathematics