PPDistiller: Weakly-supervised 3D point cloud semantic segmentation framework via point-to-pixel distillation

Yong Zhang,Zhaolong Wu,Rukai Lan,Yingjie Liang,Yifan Liu

doi:10.1016/j.knosys.2024.112654

Abstract

Despite the significant growth in the availability of 3D light detection and ranging (LiDAR) point cloud data in recent years, annotation remains expensive and time-consuming. This has led to an increasing demand for weakly-supervised semantic segmentation (WSSS) methods in applications such as autonomous driving, mapping, and robotics. Existing approaches typically rely solely on LiDAR point cloud data for WSSS, which often results in lower segmentation accuracy due to the sparsity of point clouds. To address these challenges, we propose a novel architecture, PPDistiller, which employs multiple teacher networks from different modalities. Compared to other WSSS and multimodal approaches, PPDistiller achieves superior segmentation accuracy with fewer annotations. This is facilitated through the novel Mean Multi-Teacher Framework (MMT), which incorporates multiple modalities and teachers. To address the issue of lacking 2D labels, we propose the Distance-CAM Self-Training (DCAM-ST) module, which utilizes sparse 3D weak annotations to produce accurate 2D pixel-level annotations. To enable adaptive fusion of 2D and 3D data, we introduce the Attention Point to Pixel Fusion (APPF) module, facilitating bidirectional transfer of cross-modal knowledge. Additionally, to fully leverage the spatial semantic information in point cloud, we propose the Pyramid Semantic-context Neighbor Aggregation (PSNA) module, aiming to exploit spatial and semantic correlations to improve performance. Extensive experimentation on SemanticKITTI, ScribbleKITTI and nuScenes datasets demonstrates the superiority of our proposed method. Compared to state-of-the-art fusion and weakly-supervised methods, PPDistiller achieves the highest mean Intersection over Union (mIoU) scores under both fully-supervised and weakly-supervised settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PPDistiller: Weakly-supervised 3D point cloud semantic segmentation framework via point-to-pixel distillation

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Similar Papers

Low-Latency LiDAR Semantic Segmentation
Takahiro Hori ... Takehisa Yairi
-
Takahiro Hori, et. al.Takahiro Hori ... Takehisa Yairi
23 Oct 2022
23 Oct 2022

Identification of tree species based on the fusion of UAV hyperspectral image and LiDAR data in a coniferous and broad-leaved mixed forest in Northeast China.
Hao Zhong ... Tiantian Wang
Frontiers in Plant Science | VOL. 13
Hao Zhong, et. al.Hao Zhong ... Tiantian Wang
23 Sep 2022
Frontiers in Plant Science | VOL. 13

Label-efficient semantic segmentation of large-scale industrial point clouds using weakly supervised learning
Chao Yin ... Ji Yang
Automation in Construction | VOL. 148
Chao Yin, et. al.Chao Yin ... Ji Yang
26 Jan 2023
Automation in Construction | VOL. 148

Modeling and analysis of light detection and ranging point cloud error in vibration state of airborne platform
Zhaodong Lin ... Jiawei Yu
Optical Engineering | VOL. 61
Zhaodong Lin, et. al.Zhaodong Lin ... Jiawei Yu
26 Feb 2022
Optical Engineering | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PPDistiller: Weakly-supervised 3D point cloud semantic segmentation framework via point-to-pixel distillation

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems