Multi-modal 3D object detection by 2D-guided precision anchor proposal and multi-layer fusion

Yi Wu,Xiaoyan Jiang,Zhijun Fang,Yongbin Gao,Hamido Fujita

doi:10.1016/j.asoc.2021.107405

Abstract

3D object detection, of which the goal is to obtain the 3D spatial structure information of the object, is a challenging topic in many visual perception systems, e.g., autonomous driving, augmented reality, and robot navigation. Most existing region proposal network (RPN) based 3D object detection methods generate anchors in the whole 3D searching space without using semantic information, which leads to the problem of inappropriate anchor size generation. To tackle the issue, we propose a 2D-guided precision anchor generation network (PAG-Net). Specifically speaking, we utilize a mature 2D detector to get 2D bounding boxes and category labels of objects as prior information. Then the 2D bounding boxes are projected into 3D frustum space for more precise and category-adaptive 3D anchors. Furthermore, current feature combination methods are early fusion, late fusion, and deep fusion, which only fuse features from high convolutional layers and ignore the data missing problem of point clouds. To obtain more efficient fusion of RGB images and point clouds features, we propose a multi-layer fusion model, which conducts nonlinear and iterative combinations of features from multiple convolutional layers and merges the global and local features effectively. We encode point cloud with the bird’s eye view (BEV) representation to solve the irregularity of point cloud. Experimental results show that our proposed approach improves the baseline by a large margin and outperforms most of the state-of-the-art methods on the KITTI object detection benchmark.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-modal 3D object detection by 2D-guided precision anchor proposal and multi-layer fusion

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Journal: Applied Soft Computing	Publication Date: Apr 23, 2021
Citations: 26

Similar Papers

MLOD: A multi-view 3D object detection based on robust feature fusion method
Jian Deng ... Krzysztof Czarnecki
-
Jian Deng, et. al.Jian Deng ... Krzysztof Czarnecki
01 Oct 2019
01 Oct 2019

CG-SSD: Corner guided single stage 3D object detection from LiDAR point cloud
Ruiqi Ma ... Zongtian Hu
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 191
Ruiqi Ma, et. al.Ruiqi Ma ... Zongtian Hu
14 Jul 2022
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 191

Monocular 3D Object Detection with Pseudo-LiDAR Point Cloud
Xinshuo Weng ... Kris Kitani
-
Xinshuo Weng, et. al.Xinshuo Weng ... Kris Kitani
01 Oct 2019
01 Oct 2019

GAF-RCNN: Grid attention fusion 3D object detection from point cloud
Zheng Li ... Guofeng Tong
Cobot | VOL. 2
Zheng Li, et. al.Zheng Li ... Guofeng Tong
21 Feb 2023
Cobot | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-modal 3D object detection by 2D-guided precision anchor proposal and multi-layer fusion

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing