RGB-D Semantic Segmentation and Label-Oriented Voxelgrid Fusion for Accurate 3D Semantic Mapping

Wenjun Shi,Jingwei Xu,Xianshun Wang,Guanghui Zhang,Dongchen Zhu,Jiamao Li,Xiaolin Zhang

doi:10.1109/tcsvt.2021.3056726

Abstract

The 3D semantic map plays an increasingly important role in a wide variety of applications, especially for many kinds of task-driven robots. In this paper, we present a semantic mapping methodology for 3D semantic map obtaining from RGB-D scans. In contrast to existing methods that use 3D annotated information as supervisory, we focus on accurate 2D frame labeling and combine labels in 3D space using semantic fusion mechanism. For scene parsing, a two-stream network with a novel discriminatory mask loss is proposed to explore sufficient extraction and fusion of RGB and depth information achieving steadily semantic segmentation. The discriminatory mask guides the cross-entropy loss function and interprets the influence of different pixels on back-propagation, which reduces the harmful effects of the depth noise or the fallible annotation at the edges of objects. After the correspondences between frames are provided, these semantic frames are fused in unified 3D coordinates using the novel label-oriented voxelgrid filter. It can ensure the intra-frame spatial continuity and the inter-frame spatiotemporal consistency through introducing the label-oriented statistical principle into labeled point clouds. In order to avoid the unfavorable interference between uncorrelated frames, we further propose an adaptive grouping algorithm by applying the view frustum filter to group frames with sufficient overlap as a segment. To this end, we demonstrate the effectiveness of the proposed method on the 2D/3D semantic label benchmark of ScanNetv2 and Cityscapes datasets.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RGB-D Semantic Segmentation and Label-Oriented Voxelgrid Fusion for Accurate 3D Semantic Mapping

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Feb 3, 2021
Citations: 27

Similar Papers

RGB[formula omitted]D: Learning depth-weighted RGB patches for RGB-D indoor semantic segmentation
Jinming Cao ... Yangyan Li
Neurocomputing | VOL. 462
Jinming Cao, et. al.Jinming Cao ... Yangyan Li
12 Aug 2021
Neurocomputing | VOL. 462

Semantic Scene Mapping with Spatio-temporal Deep Neural Network for Robotic Applications
Ruihao Li ... Dongbing Gu
Cognitive Computation | VOL. 10
Ruihao Li, et. al.Ruihao Li ... Dongbing Gu
24 Nov 2017
Cognitive Computation | VOL. 10

Real-time localization and 3D semantic map reconstruction for unstructured citrus orchards
Juntao Xiong ... Zhengang Yang
Computers and Electronics in Agriculture | VOL. 213
Juntao Xiong, et. al.Juntao Xiong ... Zhengang Yang
12 Sep 2023
Computers and Electronics in Agriculture | VOL. 213

3D Semantic Mapping from Arthroscopy Using Out-of-Distribution Pose and Depth and In-Distribution Segmentation Training
Yaqub Jonmohamadi ... Gustavo Carneiro
-
Yaqub Jonmohamadi, et. al.Yaqub Jonmohamadi ... Gustavo Carneiro
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RGB-D Semantic Segmentation and Label-Oriented Voxelgrid Fusion for Accurate 3D Semantic Mapping

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society