Image-Based Rendering for Large-Scale Outdoor Scenes With Fusion of Monocular and Multi-View Stereo Depth

Shaohua Liu,Xiaona Zhang,Tianlu Mao,Shuang Liu,Jing Liu,Zhaoxin Li,Minghao Li

doi:10.1109/access.2020.3004431

Abstract

Image-based rendering (IBR) attempts to synthesize novel views using a set of observed images. Some IBR approaches (such as light fields) have yielded impressive high-quality results on small-scale scenes with dense photo capture. However, available wide-baseline IBR methods are still restricted by the low geometric accuracy and completeness of multi-view stereo (MVS) reconstruction on low-textured and non-Lambertian surfaces. The issues become more significant in large-scale outdoor scenes due to challenging scene content, e.g., buildings, trees, and sky. To address these problems, we present a novel IBR algorithm that consists of two key components. First, we propose a novel depth refinement method that combines MVS depth maps with monocular depth maps predicted via deep learning. A lookup table remap is proposed for converting the scale of the monocular depths to be consistent with the scale of the MVS depths. Then, the rescaled monocular depth is used as the constraint in the minimum spanning tree (MST)-based nonlocal filter to refine the per-view MVS depth. Second, we present an efficient shape-preserving warping algorithm that uses superpixels to generate the warped images and blend expected novel views of scenes. The proposed method has been evaluated on public MVS and view synthesis datasets, as well as newly captured large-scale outdoor datasets. In comparison with state-of-the-art methods, the experimental results demonstrated that the proposed method can obtain more complete and reliable depth maps for the challenging large-scale outdoor scenes, thereby resulting in more promising novel view synthesis.

Highlights

With the increasing demand for immersive 3D content, many view synthesis methods [1]–[5] for providing realistic interactive virtual navigation have been proposed
To improve the quality of depth estimation and view synthesis for large-scale outdoor scenes, in this work, we propose an image-based rendering (IBR) method that is based on fusion of monocular and multi-view stereo (MVS) depth
Since the MVS depth and the monocular depth are from distributions that differ substantially in terms of scale, we present a novel layerwise mapping between the monocular depth and the MVS depth via a lookup table

Summary

INTRODUCTION

With the increasing demand for immersive 3D content, many view synthesis methods [1]–[5] for providing realistic interactive virtual navigation have been proposed. To improve the quality of depth estimation and view synthesis for large-scale outdoor scenes, in this work, we propose an IBR method that is based on fusion of monocular and MVS depth. Our main contributions are summarized as follows: 1) A lookup-table-based strategy that remaps the monocular depth to the scale of the MVS depth; 2) An MST-based algorithm for fusing the monocular depth and the MVS depth, which can fill in irregularities and large holes of MVS depth maps while preserving geometric details; 3) A complete pipeline for image-based outdoor scenes navigation, which includes a refinement method for depth estimation, and a superpixel-based shapepreserving warp for view synthesis.

RELATED WORK

DEPTH ESTIMATION Multi-View 3D Reconstruction

WARPING AND RENDERING

RESULTS AND COMPARISONS

EVALUATION OF THE DEPTH REFINEMENT RESULTS

CONCLUSIONS

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Image-Based Rendering for Large-Scale Outdoor Scenes With Fusion of Monocular and Multi-View Stereo Depth

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Deep blending for free-viewpoint image-based rendering
Peter Hedman ... Gabriel Brostow
ACM Transactions on Graphics | VOL. 37
Peter Hedman, et. al.Peter Hedman ... Gabriel Brostow
04 Dec 2018
ACM Transactions on Graphics | VOL. 37

Crafting Monocular Cues and Velocity Guidance for Self-Supervised Multi-Frame Depth Learning
Xiaofeng Wang ... Xu Chi
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Xiaofeng Wang, et. al.Xiaofeng Wang ... Xu Chi
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Large-Scale 3D Reconstruction from Multi-View Imagery: A Comprehensive Review
Haitao Luo ... Lili Zhang
Remote Sensing | VOL. 16
Haitao Luo, et. al.Haitao Luo ... Lili Zhang
22 Feb 2024
Remote Sensing | VOL. 16

Spectral analysis for sampling image-based rendering data
Cha Zhang ... Tsuhan Chen
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 13
Cha Zhang, et. al. Cha Zhang ... Tsuhan Chen
01 Nov 2003
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image-Based Rendering for Large-Scale Outdoor Scenes With Fusion of Monocular and Multi-View Stereo Depth

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access