AIFormer: Adaptive Interaction Transformer for 3D Point Cloud Understanding

Xutao Chu,Shengjie Zhao,Hongwei Dai

doi:10.3390/rs16214103

Abstract

Recently, significant advancements have been made in 3D point cloud analysis by leveraging transformer architecture in 3D space. However, it remains challenging to effectively implement local and global learning within irregular and sparse structures of 3D point clouds. This paper presents the Adaptive Interaction Transformer (AIFormer), a novel hierarchical transformer architecture designed to enhance 3D point cloud analysis by fusing local and global features through the adaptive interaction of features. Specifically, AIFormer mainly consists of several stacked AIFormer Blocks. Each AIFormer module employs the Local Relation Aggregation Module and the Global Context Aggregation Module, respectively, to extract local details of relationships within the reference point and long-range dependencies between reference points. Then, the local and global features are fused using the Adaptive Interaction Module for adaptive interaction to optimize the point representation. Additionally, the AIFormer Block further designs geometric relation functions and contextual relative semantic encoding to enhance local and global feature extraction capabilities, respectively. Extensive experiments on three popular 3D point cloud datasets verify that AIFormer achieves state-of-the-art or comparable performances. Our comprehensive ablation study further validates the effectiveness and soundness of the AIFormer design.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AIFormer: Adaptive Interaction Transformer for 3D Point Cloud Understanding

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Journal: Remote Sensing	Publication Date: Nov 2, 2024
License type: CC BY 4.0

Similar Papers

FWNet: Semantic Segmentation for Full-Waveform LiDAR Data Using Deep Learning.
Takayuki Shinohara ... Masashi Matsuoka
Sensors | VOL. 20
Takayuki Shinohara, et. al.Takayuki Shinohara ... Masashi Matsuoka
24 Jun 2020
Sensors | VOL. 20

LLGF-Net: Learning Local and Global Feature Fusion for 3D Point Cloud Semantic Segmentation
Jiazhe Zhang ... Zheng Zhang
Electronics | VOL. 11
Jiazhe Zhang, et. al.Jiazhe Zhang ... Zheng Zhang
13 Jul 2022
Electronics | VOL. 11

3D point cloud semantic segmentation toward large-scale unstructured agricultural scene classification
Yi Chen ... Qian Zhang
Computers and Electronics in Agriculture | VOL. 190
Yi Chen, et. al.Yi Chen ... Qian Zhang
13 Sep 2021
Computers and Electronics in Agriculture | VOL. 190

Concepts and techniques for processing and rendering of massive 3D point clouds

-

01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AIFormer: Adaptive Interaction Transformer for 3D Point Cloud Understanding

Abstract

Talk to us

Similar Papers

More From: Remote Sensing