PSVMLP: Point and Shifted Voxel MLP for 3D deep learning

Guanghu Xie,Yang Liu,Yiming Ji,Zongwu Xie,Baoshi Cao

doi:10.1016/j.patrec.2024.05.016

Abstract

We propose a high-performance 3D feature extraction deep learning network based on point cloud and shifted voxel, named Point and Shifted Voxel MLP (PSVMLP). The main component of PSVMLP is simple Multi-Layer Perceptron (MLP) structure. PSVMLP achieves effective extraction of multi-scale features from 3D data. Specifically, we combine point cloud and voxel-based feature extraction methods. In voxel representation learning, we propose a wide-range geometric feature extraction method based on axial shifting operations and simple MLP structure. The axial shifting operations allow shifting voxels in the depth, height, and width directions, capturing more geometric information. In point cloud representation learning, we use simple MLP structure to extract local features, and we also extract global features by combining transformer structure. By combining point cloud and voxel feature extraction methods, we obtain rich feature representations from different scales, enhancing the model’s expressive power and generalization performance. Applying our designed model to basic geometric feature learning tasks, we achieve excellent results. Despite being built primarily on a simple MLP framework, our model demonstrates remarkable performance on both shape classification and shape part segmentation tasks. Our code is available at https://github.com/hitxraz/psvmlp.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PSVMLP: Point and Shifted Voxel MLP for 3D deep learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Similar Papers

Point cloud semantic segmentation based on local feature fusion and multilayer attention network
Junjie Wen ... Tong Nie
IET Computer Vision | VOL. 18
Junjie Wen, et. al.Junjie Wen ... Tong Nie
27 Nov 2023
IET Computer Vision | VOL. 18

Integrating Normal Vector Features into an Atrous Convolution Residual Network for LiDAR Point Cloud Classification
Chunjiao Zhang ... Yu Ma
Remote Sensing | VOL. 13
Chunjiao Zhang, et. al.Chunjiao Zhang ... Yu Ma
29 Aug 2021
Remote Sensing | VOL. 13

TransPose: 6D object pose estimation with geometry-aware Transformer
Xiao Lin ... Qijun Chen
Neurocomputing | VOL. 589
Xiao Lin, et. al.Xiao Lin ... Qijun Chen
05 Apr 2024
Neurocomputing | VOL. 589

Adaptive learning point cloud and image diversity feature fusion network for 3D object detection
Weiqing Yan ... Jindong Xu
Complex & Intelligent Systems | VOL. 10
Weiqing Yan, et. al.Weiqing Yan ... Jindong Xu
15 Dec 2023
Complex & Intelligent Systems | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PSVMLP: Point and Shifted Voxel MLP for 3D deep learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters