View-aligned pixel-level feature aggregation for 3D shape classification

Yong Xu,Shaohui Pan,Ruotao Xu,Haibin Ling

doi:10.1016/j.cviu.2024.104098

Abstract

Multi-view 3D shape classification, which identifies a 3D shape based on its 2D views rendered from different viewpoints, has emerged as a promising method of shape understanding. A key building block in these methods is cross-view feature aggregation. However, existing methods dominantly follow the “extract-then-aggregate” pipeline for view-level global feature aggregation, leaving cross-view pixel-level feature interaction under-explored. To tackle this issue, we develop a “fuse-while-extract” pipeline, with a novel View-aligned Pixel-level Fusion (VPF) module to fuse cross-view pixel-level features originating from the same 3D part. We first reconstruct the 3D coordinate of each feature via the rasterization results, then match and fuse the features via spatial neighbor searching. Incorporating the proposed VPF module with ResNet18 backbone, we build a novel view-aligned multi-view network, which conducts feature extraction and cross-view fusion alternatively. Extensive experiments have demonstrated the effectiveness of the VPF module as well as the excellent performance of the proposed network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

View-aligned pixel-level feature aggregation for 3D shape classification

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding

Lead the way for us

Similar Papers

A GCN and Transformer complementary network for skeleton-based action recognition
Xuezhi Xiang ... Abdulmotaleb El Saddik
Computer Vision and Image Understanding | VOL. 249
Xuezhi Xiang, et. al.Xuezhi Xiang ... Abdulmotaleb El Saddik
22 Oct 2024
Computer Vision and Image Understanding | VOL. 249

Invisible backdoor attack with attention and steganography
Wenmin Chen ... Yangming Chen
Computer Vision and Image Understanding | VOL. 249
Wenmin Chen, et. al.Wenmin Chen ... Yangming Chen
19 Oct 2024
Computer Vision and Image Understanding | VOL. 249

FTM: The Face Truth Machine—Hand-crafted features from micro-expressions to support lie detection
Maria De Marsico ... Donato Francesco Pio Stanco
Computer Vision and Image Understanding | VOL. 249
Maria De Marsico, et. al.Maria De Marsico ... Donato Francesco Pio Stanco
16 Oct 2024
Computer Vision and Image Understanding | VOL. 249

PMGNet: Disentanglement and entanglement benefit mutually for compositional zero-shot learning
Yu Liu ... Nicu Sebe
Computer Vision and Image Understanding | VOL. 249
Yu Liu, et. al.Yu Liu ... Nicu Sebe
16 Oct 2024
Computer Vision and Image Understanding | VOL. 249

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

View-aligned pixel-level feature aggregation for 3D shape classification

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding