Guided Depth Map Super-Resolution Using Recumbent Y Network

Tao Li,Hongwei Lin,Xiucheng Dong

doi:10.1109/access.2020.3007667

Tao Li, Hongwei Lin + Show 1 more

Open Access

PDF Available

https://doi.org/10.1109/access.2020.3007667

Copy DOI

Export

Save

Cite

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 10	License type: CC BY 4.0

Affiliation: Xihua University, Minzu University of China

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Low spatial resolution is a well-known problem for depth maps captured by low-cost consumer depth cameras. Depth map super-resolution (SR) can be used to enhance the resolution and improve the quality of depth maps. In this paper, we propose a recumbent Y network (RYNet) to integrate the depth information and intensity information for depth map SR. Specifically, we introduce two weight-shared encoders to respectively learn multi-scale depth and intensity features, and a single decoder to gradually fuse depth information and intensity information for reconstruction. We also design a residual channel attention based atrous spatial pyramid pooling structure to further enrich the feature's scale diversity and exploit the correlations between multi-scale feature channels. Furthermore, the violations of co-occurrence assumption between depth discontinuities and intensity edges will generate texture-transfer and depth-bleeding artifacts. Thus, we propose a spatial attention mechanism to mitigate the artifacts by adaptively learning the spatial relevance between intensity features and depth features and reweighting the intensity features before fusion. Experimental results demonstrate the superiority of the proposed RYNet over several state-of-the-art depth map SR methods.

Highlights

Depth map has many applications in practice, such as autonomous driving, virtual reality, 3D reconstruction
The training dataset consisting of 82 RGB-D images is provided by [4]
The HR depth maps and HR intensity images are cropped into 96 × 96 image patches for scaling factors 2× and 4×, and 128 × 128 image patches for scaling factors 8× and 16 ×, respectively

Summary

Introduction

Depth map has many applications in practice, such as autonomous driving, virtual reality, 3D reconstruction. Recent consumer depth cameras have provided a convenient way to acquire depth maps. The depth maps captured by these cameras usually suffer from low spatial resolution. The resolution of the depth map taken by Kinect v2 is only 512 × 424. Depth map SR aims to reconstruct a high-resolution (HR) depth map from its low-resolution (LR) counterpart. It has inherent ill-posedness, since there may exist multiple HR depth maps that can produce an identical LR depth map after degradation. Numerous depth map SR methods have been proposed to alleviate the ill-posedness, including filter-based methods, optimization-based methods, and learning-based methods.

Methods

Results

Conclusion