Efficient Edge-Preserving Multi-View Stereo Network for Depth Estimation

Wanjuan Su,Wenbing Tao

doi:10.1609/aaai.v37i2.25330

Abstract

Over the years, learning-based multi-view stereo methods have achieved great success based on their coarse-to-fine depth estimation frameworks. However, 3D CNN-based cost volume regularization inevitably leads to over-smoothing problems at object boundaries due to its smooth properties. Moreover, discrete and sparse depth hypothesis sampling exacerbates the difficulty in recovering the depth of thin structures and object boundaries. To this end, we present an Efficient edge-Preserving multi-view stereo Network (EPNet) for practical depth estimation. To keep delicate estimation at details, a Hierarchical Edge-Preserving Residual learning (HEPR) module is proposed to progressively rectify the upsampling errors and help refine multi-scale depth estimation. After that, a Cross-view Photometric Consistency (CPC) is proposed to enhance the gradient flow for detailed structures, which further boosts the estimation accuracy. Last, we design a lightweight cascade framework and inject the above two strategies into it to achieve better efficiency and performance trade-offs. Extensive experiments show that our method achieves state-of-the-art performance with fast inference speed and low memory usage. Notably, our method tops the first place on challenging Tanks and Temples advanced dataset and ETH3D high-res benchmark among all published learning-based methods. Code will be available at https://github.com/susuwj/EPNet.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Edge-Preserving Multi-View Stereo Network for Depth Estimation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 3

Similar Papers

Adaptive depth estimation for pyramid multi-view stereo
Jie Liao ... Chunxia Xiao
Computers & Graphics | VOL. 97
Jie Liao, et. al.Jie Liao ... Chunxia Xiao
24 Apr 2021
Computers & Graphics | VOL. 97

Normal Assisted Pixel-Visibility Learning With Cost Aggregation for Multiview Stereo
Wei Tong ... Pedram Ghamisi
Intelligent Transportation Systems, IEEE Transactions on | VOL. 23
Wei Tong, et. al.Wei Tong ... Pedram Ghamisi
01 Dec 2022
Intelligent Transportation Systems, IEEE Transactions on | VOL. 23

A depth estimation framework based on unsupervised learning and cross-modal translation
Jiafeng Shen ... Ric Schleijpen
-
Jiafeng Shen, et. al.Jiafeng Shen ... Ric Schleijpen
23 Oct 2019
23 Oct 2019

Self-supervised learning of monocular depth using quantized networks
Keyu Lu ... Yonghu Zeng
Neurocomputing | VOL. 488
Keyu Lu, et. al.Keyu Lu ... Yonghu Zeng
06 Dec 2021
Neurocomputing | VOL. 488

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Edge-Preserving Multi-View Stereo Network for Depth Estimation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence