Multi-Task Foreground-Aware Network with Depth Completion for Enhanced RGB-D Fusion Object Detection Based on Transformer.

Jiasheng Pan,Tao Yue,Yankun Yin,Songyi Zhong,Yanhao Tang

doi:10.3390/s24072374

Abstract

Fusing multiple sensor perceptions, specifically LiDAR and camera, is a prevalent method for target recognition in autonomous driving systems. Traditional object detection algorithms are limited by the sparse nature of LiDAR point clouds, resulting in poor fusion performance, especially for detecting small and distant targets. In this paper, a multi-task parallel neural network based on the Transformer is constructed to simultaneously perform depth completion and object detection. The loss functions are redesigned to reduce environmental noise in depth completion, and a new fusion module is designed to enhance the network's perception of the foreground and background. The network leverages the correlation between RGB pixels for depth completion, completing the LiDAR point cloud and addressing the mismatch between sparse LiDAR features and dense pixel features. Subsequently, we extract depth map features and effectively fuse them with RGB features, fully utilizing the depth feature differences between foreground and background to enhance object detection performance, especially for challenging targets. Compared to the baseline network, improvements of 4.78%, 8.93%, and 15.54% are achieved in the difficult indicators for cars, pedestrians, and cyclists, respectively. Experimental results also demonstrate that the network achieves a speed of 38 fps, validating the efficiency and feasibility of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Task Foreground-Aware Network with Depth Completion for Enhanced RGB-D Fusion Object Detection Based on Transformer.

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Journal: Sensors	Publication Date: Apr 8, 2024
License type: CC BY 4.0

Similar Papers

A review of object detection based on deep learning
Youzi Xiao ... Jiachen Yu
Multimedia Tools and Applications | VOL. 79
Youzi Xiao, et. al.Youzi Xiao ... Jiachen Yu
12 Jun 2020
Multimedia Tools and Applications | VOL. 79

An Advanced Approach to Object Detection and Tracking in Robotics and Autonomous Vehicles Using YOLOv8 and LiDAR Data Fusion
Yanyan Dai ... Kidong Lee
Electronics | VOL. 13
Yanyan Dai, et. al.Yanyan Dai ... Kidong Lee
07 Jun 2024
Electronics | VOL. 13

Dual-view 3D object recognition and detection via Lidar point cloud and camera image
Jing Li ... Xu Liu
Robotics and Autonomous Systems | VOL. 150
Jing Li, et. al.Jing Li ... Xu Liu
03 Jan 2022
Robotics and Autonomous Systems | VOL. 150

A Multi-Sensor 3D Detection Method for Small Objects
Yuekun Zhao ... Dan Wei
World Electric Vehicle Journal | VOL. 15
Yuekun Zhao, et. al.Yuekun Zhao ... Dan Wei
10 May 2024
World Electric Vehicle Journal | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Task Foreground-Aware Network with Depth Completion for Enhanced RGB-D Fusion Object Detection Based on Transformer.

Abstract

Talk to us

Similar Papers

More From: Sensors