Dense Depth Image Research Articles

Depth completion is the task of reconstructing dense depth images from sparse LiDAR data. LiDAR depth completion, for which LiDAR data is the only input, is an ill-posed and challenging problem owing to the underlying properties of LiDAR data: extremely few points, presence of discontinuities, and absence of texture information. Accordingly, most approaches are heavily dependent on guided color images, which leads to unsatisfactory results when the color images are degraded. To alleviate the dependency on color images but leverage this information during training, we present a deep convolutional neural network (CNN) consisting of depth and edge CNNs via transferring of knowledge. In order to compensate for the limitations of LiDAR data, we design the edge CNN to learn a gradient depth image from a powerful teacher network through the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Knowledge-Distillation</i> method. Since the teacher network is trained with color images, color-embedded information can be obtained in the test phase even if color images are not used as an input. We further propose a <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Self-Distillation</i> method for transferring the color-embedded features from the edge CNN to the depth CNN. Enforcing the depth features to contain edge information hardly observed in LiDAR data enables the depth CNN to generate more edge-attentive and structure-preserving results. Our novel methods show remarkable results in outdoor and indoor environments for KITTI and NYU-Depth-V2 datasets. Experiments performed with low-channel LiDAR data in KITTI and few depth points in the NYU-Depth-V2 dataset show that our method is robust to data sparsity and applicable in various scenarios.

Read full abstract

To provide a realistic environment for remote sensing applications, point clouds are used to realize a three-dimensional (3D) digital world for the user. Motion recognition of objects, e.g., humans, is required to provide realistic experiences in the 3D digital world. To recognize a user’s motions, 3D landmarks are provided by analyzing a 3D point cloud collected through a light detection and ranging (LiDAR) system or a red green blue (RGB) image collected visually. However, manual supervision is required to extract 3D landmarks as to whether they originate from the RGB image or the 3D point cloud. Thus, there is a need for a method for extracting 3D landmarks without manual supervision. Herein, an RGB image and a 3D point cloud are used to extract 3D landmarks. The 3D point cloud is utilized as the relative distance between a LiDAR and a user. Because it cannot contain all information the user’s entire body due to disparities, it cannot generate a dense depth image that provides the boundary of user’s body. Therefore, up-sampling is performed to increase the density of the depth image generated based on the 3D point cloud; the density depends on the 3D point cloud. This paper proposes a system for extracting 3D landmarks using 3D point clouds and RGB images without manual supervision. A depth image provides the boundary of a user’s motion and is generated by using 3D point cloud and RGB image collected by a LiDAR and an RGB camera, respectively. To extract 3D landmarks automatically, an encoder–decoder model is trained with the generated depth images, and the RGB images and 3D landmarks are extracted from these images with the trained encoder model. The method of extracting 3D landmarks using RGB depth (RGBD) images was verified experimentally, and 3D landmarks were extracted to evaluate the user’s motions with RGBD images. In this manner, landmarks could be extracted according to the user’s motions, rather than by extracting them using the RGB images. The depth images generated by the proposed method were 1.832 times denser than the up-sampling-based depth images generated with bilateral filtering.

Read full abstract

Dense Depth Image Research Articles

Related Topics

Articles published on Dense Depth Image

NSVDNet: Normalized Spatial-Variant Diffusion Network for Robust Image-Guided Depth Completion

ST-DepthNet: A Spatio-Temporal Deep Network for Depth Completion Using a Single Non-Repetitive Circular Scanning Lidar

I2D-Loc: Camera localization via image to LiDAR depth flow

Collaborative 3D face alignment and head pose estimation with frontal face constraint based on RGB and sparse depth

LiDAR Depth Completion Using Color-Embedded Information via Knowledge Distillation

Robust Depth Completion with Uncertainty-Driven Loss Functions

NNNet: New Normal Guided Depth Completion From Sparse LiDAR Data and Single Color Image

AGNet: Attention Guided Sparse Depth Completion Using Convolutional Neural Networks

CodeMapping: Real-Time Dense Mapping for Sparse SLAM using Compact Scene Representations

A multi-cue guidance network for depth completion

Automatic 3D Landmark Extraction System Based on an Encoder–Decoder Using Fusion of Vision and LiDAR

Object Segmentation Ensuring Consistency Across Multi-Viewpoint Images.

Joint inpainting of depth and reflectance with visibility estimation

Dense RGB-D Map-Based Human Tracking and Activity Recognition using Skin Joints Features and Self-Organizing Map

Dense depth image synthesis via energy minimization for three-dimensional video

Detecting human falls with 3-axis accelerometer and depth sensor.

Robust curb detection with fusion of 3D-Lidar and camera data.

人物シルエットの重なりを考慮したテンプレートを用いたステレオビジョン複数人物追跡

ORIENTATION AND DENSE RECONSTRUCTION OF UNORDERED TERRESTRIAL AND AERIAL WIDE BASELINE IMAGE SETS

The Benefits of Dense Stereo for Pedestrian Detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dense Depth Image Research Articles

Related Topics

Articles published on Dense Depth Image

NSVDNet: Normalized Spatial-Variant Diffusion Network for Robust Image-Guided Depth Completion

ST-DepthNet: A Spatio-Temporal Deep Network for Depth Completion Using a Single Non-Repetitive Circular Scanning Lidar

I2D-Loc: Camera localization via image to LiDAR depth flow

Collaborative 3D face alignment and head pose estimation with frontal face constraint based on RGB and sparse depth

LiDAR Depth Completion Using Color-Embedded Information via Knowledge Distillation

Robust Depth Completion with Uncertainty-Driven Loss Functions

NNNet: New Normal Guided Depth Completion From Sparse LiDAR Data and Single Color Image

AGNet: Attention Guided Sparse Depth Completion Using Convolutional Neural Networks

CodeMapping: Real-Time Dense Mapping for Sparse SLAM using Compact Scene Representations

A multi-cue guidance network for depth completion

Automatic 3D Landmark Extraction System Based on an Encoder–Decoder Using Fusion of Vision and LiDAR

Object Segmentation Ensuring Consistency Across Multi-Viewpoint Images.

Joint inpainting of depth and reflectance with visibility estimation

Dense RGB-D Map-Based Human Tracking and Activity Recognition using Skin Joints Features and Self-Organizing Map

Dense depth image synthesis via energy minimization for three-dimensional video

Detecting human falls with 3-axis accelerometer and depth sensor.

Robust curb detection with fusion of 3D-Lidar and camera data.

人物シルエットの重なりを考慮したテンプレートを用いたステレオビジョン複数人物追跡

ORIENTATION AND DENSE RECONSTRUCTION OF UNORDERED TERRESTRIAL AND AERIAL WIDE BASELINE IMAGE SETS

The Benefits of Dense Stereo for Pedestrian Detection