Depth Discontinuities Research Articles

Also after many years of research, stereo matching remains to be a challenging task in photogrammetry and computer vision. Recent work has achieved great progress by formulating dense stereo matching as a pixel-wise learning task to be resolved with a deep convolutional neural network (CNN). However, most estimation methods, including traditional and deep learning approaches, still have difficulty to handle real-world challenging scenarios, especially those including large depth discontinuity and low texture areas.To tackle these problems, we investigate a recently proposed end-to-end disparity learning network, DispNet (Mayer et al., 2015), and improve it to yield better results in these problematic areas. The improvements consist of three major contributions. First, we use dilated convolutions to develop a context pyramidal feature extraction module. A dilated convolution expands the receptive field of view when extracting features, and aggregates more contextual information, which allows our network to be more robust in weakly textured areas. Second, we construct the matching cost volume with patch-based correlation to handle larger disparities. We also modify the basic encoder-decoder module to regress detailed disparity images with full resolution. Third, instead of using post-processing steps to impose smoothness in the presence of depth discontinuities, we incorporate disparity gradient information as a gradient regularizer into the loss function to preserve local structure details in large depth discontinuity areas.We evaluate our model in terms of end-point-error on several challenging stereo datasets including Scene Flow, Sintel and KITTI. Experimental results demonstrate that our model decreases the estimation error compared with DispNet on most datasets (e.g. we obtain an improvement of 46% on Sintel) and estimates better structure-preserving disparity maps. Moreover, our proposal also achieves competitive performance compared to other methods.

Read full abstract

Virtual Reality (VR) applications allow a user to explore a scene intuitively through a tracked head-mounted display (HMD). However, in complex scenes, occlusions make scene exploration inefficient, as the user has to navigate around occluders to gain line of sight to potential regions of interest. When a scene region proves to be of no interest, the user has to retrace their path, and such a sequential scene exploration implies significant amounts of wasted navigation. Furthermore, as the virtual world is typically much larger than the tracked physical space hosting the VR application, the intuitive one-to-one mapping between the virtual and real space has to be temporarily suspended for the user to teleport or redirect in order to conform to the physical space constraints. In this paper we introduce a method for improving VR exploration efficiency by automatically constructing a multiperspective visualization that removes occlusions. For each frame, the scene is first rendered conventionally, the z-buffer is analyzed to detect horizontal and vertical depth discontinuities, the discontinuities are used to define disocclusion portals which are 3D scene rectangles for routing rays around occluders, and the disocclusion portals are used to render a multiperpsective image that alleviates occlusions. The user controls the multiperspective disocclusion effect, deploying and retracting it with small head translations. We have quantified the VR exploration efficiency brought by our occlusion removal method in a study where participants searched for a stationary target, and chased a dynamic target. Our method showed an advantage over conventional VR exploration in terms of reducing the navigation distance, the view direction rotation, the number of redirections, and the task completion time. These advantages did not come at the cost of a reduction in depth perception or situational awareness, or of an increase in simulator sickness.

Read full abstract

Depth Discontinuities Research Articles

Related Topics

Articles published on Depth Discontinuities

Context pyramidal network for stereo matching regularized by disparity gradients

Visibility restoration of single image captured in dust and haze weather conditions

Bin-Picking for Planar Objects Based on a Deep Learning Network: A Case Study of USB Packs.

Multi-frame stereo matching with edges, planes, and superpixels

Object occlusion guided stereo image retargeting

Dense Stereo Matching Based on Multiobjective Fitness Function—A Genetic Algorithm Optimization Approach for Stereo Correspondence

Depth Map Upsampling via Multi-Modal Generative Adversarial Network.

A Real-Time High-Quality Complete System for Depth Image-Based Rendering on FPGA

Motion parallax for 360° RGBD video

A Perception-driven Hybrid Decomposition for Multi-layer Accommodative Displays.

VR Exploration Assistance through Automatic Occlusion Removal

Receiver function imaging of mantle transition zone discontinuities and the origin of volcanism beneath Libya

An Efficient Dense Stereo Matching Method for Planetary Rover

Learning Efficient Stereo Matching Network With Depth Discontinuity Aware Super-Resolution

Disparity Refinement in Depth Discontinuity Using Robustly Matched Straight Lines for Digital Surface Model Generation

Light-field-based absolute phase unwrapping.

3D Stereo Reconstruction of SEM Images

Weighted Large Margin Nearest Center Distance-Based Human Depth Recovery With Limited Bandwidth Consumption.

Elemental image array generation method by using optimized depth image‐based rendering algorithm for integral imaging display

DepthCut: improved depth edge estimation using multiple unreliable channels

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Depth Discontinuities Research Articles

Related Topics

Articles published on Depth Discontinuities

Context pyramidal network for stereo matching regularized by disparity gradients

Visibility restoration of single image captured in dust and haze weather conditions

Bin-Picking for Planar Objects Based on a Deep Learning Network: A Case Study of USB Packs.

Multi-frame stereo matching with edges, planes, and superpixels

Object occlusion guided stereo image retargeting

Dense Stereo Matching Based on Multiobjective Fitness Function—A Genetic Algorithm Optimization Approach for Stereo Correspondence

Depth Map Upsampling via Multi-Modal Generative Adversarial Network.

A Real-Time High-Quality Complete System for Depth Image-Based Rendering on FPGA

Motion parallax for 360° RGBD video

A Perception-driven Hybrid Decomposition for Multi-layer Accommodative Displays.

VR Exploration Assistance through Automatic Occlusion Removal

Receiver function imaging of mantle transition zone discontinuities and the origin of volcanism beneath Libya

An Efficient Dense Stereo Matching Method for Planetary Rover

Learning Efficient Stereo Matching Network With Depth Discontinuity Aware Super-Resolution

Disparity Refinement in Depth Discontinuity Using Robustly Matched Straight Lines for Digital Surface Model Generation

Light-field-based absolute phase unwrapping.

3D Stereo Reconstruction of SEM Images

Weighted Large Margin Nearest Center Distance-Based Human Depth Recovery With Limited Bandwidth Consumption.

Elemental image array generation method by using optimized depth image‐based rendering algorithm for integral imaging display

DepthCut: improved depth edge estimation using multiple unreliable channels