Scene Completion Research Articles

Collecting 3D point cloud data of buildings is important for many applications such as urban mapping, renovation, preservation, and energy simulation. However, laser-scanned point clouds are often difficult to analyze, visualize, and interpret due to incompletely scanned building facades caused by numerous sources of defects such as noise, occlusions, and moving objects. Several point cloud scene completion algorithms have been proposed in the literature, but they have been mostly applied to individual objects or small-scale indoor environments and not on large-scale scans of building facades. This paper introduces a method of performing point cloud scene completion of building facades using orthographic projection and generative adversarial inpainting methods. The point cloud is first converted into the 2D structured representation of depth and color images using an orthographic projection approach. Then, a data-driven 2D inpainting approach is used to predict the complete version of the scene, given the incomplete scene in the image domain. The 2D inpainting process is fully automated and uses a customized generative-adversarial network based on Pix2Pix that is trainable end-to-end. The inpainted 2D image is finally converted back into a 3D point cloud using depth remapping. The proposed method is compared against several baseline methods, including geometric methods such as Poisson reconstruction and hole-filling, as well as learning-based methods such as the point completion network (PCN) and TopNet. Performance evaluation is carried out based on the task of reconstructing real-world building facades from partial laser-scanned point clouds. Experimental results using the performance metrics of voxel precision, voxel recall, position error, and color error showed that the proposed method has the best performance overall.

Abstract Scene understanding is a significant research topic in computer vision, especially for robots to understand their environment intelligently. Semantic scene segmentation can help robots to identify the objects that are present in their surroundings, while semantic scene completion can enhance the ability of the robot to infer the object shape, which is pivotal for several high-level tasks. With dense Conditional Random Field (CRF), one key issue is how to construct the long-range interactions between nodes with Gaussian pairwise potentials. Another issue is what effective and efficient inference algorithms can be adapted to resolve the optimization. In this paper, we focus on semantic scene segmentation and completion optimization technology simultaneously using dense CRF based on a single depth image only. Firstly, we convert the single depth image into different down-sampled Truncated Signed Distance Function (TSDF) or flipped TSDF voxel formats, and formulate the pairwise potentials terms with such a representation. Secondly, we use the output results of an end-to-end 3D convolutional neural network named SSCNet to obtain the unary potentials. Finally, we pursue the efficiency of different CRF inference algorithms (the mean-field inference, the negative semi-definite specific difference of convex relaxation, the proximal minimization of linear programming and its variants, etc.). The proposed dense CRF and inference algorithms are evaluated on three different datasets (SUNCG, NYU, and NYUCAD). Experimental results demonstrate that the voxel-level intersection over union (IoU) of predicted voxel’s semantic and completion can reach to state-of-the-art. Specifically, for voxel semantic segmentation, the highest IoU improvements are 2.6%, 1.3%, 3.1%, and for scene completion, the highest IoU improvements are 2.5%, 3.7%, 5.4%, respectively for SUNCG, NYU, and NYUCAD datasets.

Scene Completion Research Articles

Articles published on Scene Completion

MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments

FFNet: Frequency Fusion Network for Semantic Scene Completion

Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

3D Semantic Scene Completion: A Survey

Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for Challenging Tasks in 3D Mapping

Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition

Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion

Point Cloud Semantic Scene Completion from RGB-D Images

Towards 3D LiDAR-based semantic scene understanding of 3D point cloud sequences: The SemanticKITTI Dataset

Graph Neural Network for Generative Furniture Arrangement

Anisotropic Convolutional Neural Networks for RGB-D Based Semantic Scene Completion.

Semantic Extraction of Permanent Structures for the Reconstruction of Building Interiors from Point Clouds

Point Cloud Scene Completion of Obstructed Building Facades with Generative Adversarial Inpainting.

Deep Generative Modeling for Scene Synthesis via Hybrid Representations

Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion

An Interactive Perspective Scene Completion Framework Guided by Complanate Mesh

A decentralised approach to scene completion using distributed feature hashgram

Depth Based Semantic Scene Completion With Position Importance Aware Loss

Semantic scene completion with dense CRF from a single depth image

Deep 3D semantic scene extrapolation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Scene Completion Research Articles

Articles published on Scene Completion

MotionSC: Data Set and Network for Real-Time Semantic Mapping in Dynamic Environments

FFNet: Frequency Fusion Network for Semantic Scene Completion

Not All Voxels Are Equal: Semantic Scene Completion from the Point-Voxel Perspective

3D Semantic Scene Completion: A Survey

Paris-CARLA-3D: A Real and Synthetic Outdoor Point Cloud Dataset for Challenging Tasks in 3D Mapping

Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition

Sparse Single Sweep LiDAR Point Cloud Segmentation via Learning Contextual Shape Priors from Scene Completion

Point Cloud Semantic Scene Completion from RGB-D Images

Towards 3D LiDAR-based semantic scene understanding of 3D point cloud sequences: The SemanticKITTI Dataset

Graph Neural Network for Generative Furniture Arrangement

Anisotropic Convolutional Neural Networks for RGB-D Based Semantic Scene Completion.

Semantic Extraction of Permanent Structures for the Reconstruction of Building Interiors from Point Clouds

Point Cloud Scene Completion of Obstructed Building Facades with Generative Adversarial Inpainting.

Deep Generative Modeling for Scene Synthesis via Hybrid Representations

Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion

An Interactive Perspective Scene Completion Framework Guided by Complanate Mesh

A decentralised approach to scene completion using distributed feature hashgram

Depth Based Semantic Scene Completion With Position Importance Aware Loss

Semantic scene completion with dense CRF from a single depth image

Deep 3D semantic scene extrapolation