PLC-Fusion: Perspective-Based Hierarchical and Deep LiDAR Camera Fusion for 3D Object Detection in Autonomous Vehicles

Husnain Mushtaq,Xiaoheng Deng,Fizza Azhar,Mubashir Ali,Hafiz Husnain Raza Sherazi

doi:10.3390/info15110739

Abstract

Accurate 3D object detection is essential for autonomous driving, yet traditional LiDAR models often struggle with sparse point clouds. We propose perspective-aware hierarchical vision transformer-based LiDAR-camera fusion (PLC-Fusion) for 3D object detection to address this. This efficient, multi-modal 3D object detection framework integrates LiDAR and camera data for improved performance. First, our method enhances LiDAR data by projecting them onto a 2D plane, enabling the extraction of object perspective features from a probability map via the Object Perspective Sampling (OPS) module. It incorporates a lightweight perspective detector, consisting of interconnected 2D and monocular 3D sub-networks, to extract image features and generate object perspective proposals by predicting and refining top-scored 3D candidates. Second, it leverages two independent transformers—CamViT for 2D image features and LidViT for 3D point cloud features. These ViT-based representations are fused via the Cross-Fusion module for hierarchical and deep representation learning, improving performance and computational efficiency. These mechanisms enhance the utilization of semantic features in a region of interest (ROI) to obtain more representative point features, leading to a more effective fusion of information from both LiDAR and camera sources. PLC-Fusion outperforms existing methods, achieving a mean average precision (mAP) of 83.52% and 90.37% for 3D and BEV detection, respectively. Moreover, PLC-Fusion maintains a competitive inference time of 0.18 s. Our model addresses computational bottlenecks by eliminating the need for dense BEV searches and global attention mechanisms while improving detection range and precision.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PLC-Fusion: Perspective-Based Hierarchical and Deep LiDAR Camera Fusion for 3D Object Detection in Autonomous Vehicles

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Journal: Information	Publication Date: Nov 19, 2024
License type: CC BY 4.0

Similar Papers

Fractional Intuitionistic Fuzzy Support Vector Machine: Diabetes Tweet Classification
Hassan Badi ... Karim El Moutaouakil
Information | VOL. 15
Hassan Badi, et. al.Hassan Badi ... Karim El Moutaouakil
19 Nov 2024
Information | VOL. 15

SoK: The Impact of Educational Data Mining on Organisational Administration
Hamad Almaghrabi ... Idrees Alsolbi
Information | VOL. 15
Hamad Almaghrabi, et. al.Hamad Almaghrabi ... Idrees Alsolbi
19 Nov 2024
Information | VOL. 15

PLC-Fusion: Perspective-Based Hierarchical and Deep LiDAR Camera Fusion for 3D Object Detection in Autonomous Vehicles
Husnain Mushtaq ... Hafiz Husnain Raza Sherazi
Information | VOL. 15
Husnain Mushtaq, et. al.Husnain Mushtaq ... Hafiz Husnain Raza Sherazi
19 Nov 2024
Information | VOL. 15

Benchmarking for a New Railway Accident Classification Methodology and Its Database: A Case Study in Mexico, the United States, Canada, and the European Union
Tania Elizabeth Sandoval-Valencia ... Juan C Jáuregui-Correa
Information | VOL. 15
Tania Elizabeth Sandoval-Valencia, et. al.Tania Elizabeth Sandoval-Valencia ... Juan C Jáuregui-Correa
18 Nov 2024
Information | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PLC-Fusion: Perspective-Based Hierarchical and Deep LiDAR Camera Fusion for 3D Object Detection in Autonomous Vehicles

Abstract

Talk to us

Similar Papers

More From: Information