Cascaded multi-3D-view fusion for 3D-oriented object detection

Jing Sun,Jing Xu,Yi-Mu Ji,Fei Wu,Yanfei Sun

doi:10.1016/j.compeleceng.2022.108312

Abstract

Currently, multi-view fusion methods fuse point- or proposal-level features from different views at the end stage of the backbone. This once-end-fusion method is not conducive to the timely adjustment of spatial misalignment for features from different views. Consequently, the discriminative depth and orientation details of the 3D oriented point cloud object may be filtered. To enhance the feature capture capability of the network, we introduce a cascaded multi-3D-view fusion method (CM3DV) to learn the implicit representation of object orientation. In particular, the proposed CM3DV method incorporates the cylindrical front view projection into a voxelised 3D bird’s-eye-view representation in a cascaded manner, and vice versa. Through the learning of 3D-regulated instance representation, this bi-directional mutual fusion module, called cascaded multi-view feature fusion module, alleviates the spatial misalignment of the two views. Furthermore, to learn the rotation- and shape-invariant features of objects, modulated rotation head (MRH) develops a direction-guided adjustment instead of an axis-aligned structure to extract instance-consistent features. By excluding the irrelevant content using MRH, this instance-consistent feature will benefit the object classification and orientation regression. Extensive experiments on the KITTI dataset show that the proposed method achieves a significant improvement over existing advanced methods, especially for orientation estimation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cascaded multi-3D-view fusion for 3D-oriented object detection

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering

Lead the way for us

Journal: Computers and Electrical Engineering	Publication Date: Sep 5, 2022
Citations: 1

Similar Papers

Sensory Neuroscience: From Skin to Object in the Somatosensory Cortex
Patrick Haggard
Current Biology | VOL. 16
Patrick HaggardPatrick Haggard
01 Oct 2006
Current Biology | VOL. 16

Learning a bi-directional discriminative representation for deep clustering
Yiming Wang ... Yao Zhao
Pattern Recognition | VOL. 137
Yiming Wang, et. al.Yiming Wang ... Yao Zhao
07 Dec 2022
Pattern Recognition | VOL. 137

Multi-view aggregation transformer for no-reference point cloud quality assessment
Baoyang Mu ... Qiuping Jiang
Displays | VOL. 78
Baoyang Mu, et. al.Baoyang Mu ... Qiuping Jiang
01 May 2023
Displays | VOL. 78

MLAGG-Net: Multi-level aggregation and global guidance network for pancreatic lesion segmentation in histopathological images
Ao Liu ... Jian Zheng
Biomedical Signal Processing and Control | VOL. 86
Ao Liu, et. al.Ao Liu ... Jian Zheng
01 Aug 2023
Biomedical Signal Processing and Control | VOL. 86

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cascaded multi-3D-view fusion for 3D-oriented object detection

Abstract

Talk to us

Similar Papers

More From: Computers and Electrical Engineering