Active Learning Based 3D Semantic Labeling From Images and Videos

Mengqi Rong,Hanqing Jiang,Zhanyi Hu,Hongmin Liu,Hainan Cui,Shuhan Shen

doi:10.1109/tcsvt.2021.3079991

Abstract

3D semantic segmentation is one of the most fundamental problems for 3D scene understanding and has attracted much attention in the field of computer vision. In this paper, we propose an active learning based 3D semantic labeling method for large-scale 3D mesh model generated from images or videos. Taking as input a 3D mesh model reconstructed from the image based 3D modeling system, coupled with the calibrated images, our method outputs a fine 3D semantic mesh model in which each facet is assigned a semantic label. There are three major steps in our framework: 2D semantic segmentation, 2D-3D semantic fusion, and batch image selection. A limited annotation image set is first used to fine-tune a pre-trained semantic segmentation network for obtaining the pixel-wise semantic probability maps. Then all these maps are back-projected into 3D space and fused on the 3D mesh model using Markov Random Field optimization, thus yield a preliminary 3D semantic mesh model and a heat model showing each facet’s confidence. This 3D semantic model is used as a reliable supervisor to select the parts that are not well segmented for manual annotation to boost the performance of the 2D semantic segmentation network, as well as the 3D mesh labeling, in the next iteration. This Training-Fusion-Selection process continues until the label assignment of the 3D mesh model becomes steady. By this means, we significantly reduce the amount for annotation but not the labeling quality of 3D semantic models. Extensive experiments demonstrate the effectiveness and generalization ability of our method on a wide variety of datasets.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Active Learning Based 3D Semantic Labeling From Images and Videos

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: May 13, 2021
Citations: 4

Similar Papers

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning

-

29 Dec 2020
29 Dec 2020

3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning
Mengqi Rong ... Zhanyi Hu
-
Mengqi Rong, et. al.Mengqi Rong ... Zhanyi Hu
10 Jan 2021
10 Jan 2021

Fine-Level Semantic Labeling of Large-Scale 3D Model by Active Learning
Yang Zhou ... Shuhan Shen
-
Yang Zhou, et. al.Yang Zhou ... Shuhan Shen
01 Sep 2018
01 Sep 2018

The spherical harmonic based resolution increase and decrease method for cell mesh model with the vertex and face numbers consistency
Chinyi Cheng ... Yusri Dwi Heryanto
-
Chinyi Cheng, et. al.Chinyi Cheng ... Yusri Dwi Heryanto
18 Nov 2020
18 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active Learning Based 3D Semantic Labeling From Images and Videos

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society