Normal Assisted Pixel-Visibility Learning With Cost Aggregation for Multiview Stereo

Wei Tong,Xiaorong Guan,Edmond Q Wu,Poly Z H Sun,Rob Law,Jian Kang,Pedram Ghamisi

doi:10.1109/tits.2022.3193421

Abstract

Multiple-View Stereo (MVS) aims to reconstruct the dense 3D representations of scenes. MVS has potential applications in the fields of autonomous driving (unstructured environment construction) and robotic navigation (visual-inertial navigation). To mitigate the error of depth estimation in low-textured or occluded regions, this work proposes a two-stage multi-view stereo network for fast and accurate depth estimation. The improvements of this work over the state of the art are as follows: 1) Sparse costs are constructed to jointly predict the initial depth map and surface normal by cost regularization, which proves that the surface normals can be estimated in this way with low memory consumption. 2) A new edge refinement block is developed to refine the coarse surface normal to obtain a fine-grained surface normal map. 3) Instead of using the general variance-based metric to equally aggregate cost, a new content-adaptive cost aggregation mechanism based on the similarity of the neighboring surface normal is designed for reliable cost aggregation. To the best of our knowledge, the proposed work is the first trainable network that leverages surface normal as guidance to capture neighboring pixel-visibility, which is an effective supplement to existing depth/normal estimation frameworks. Experimental results indicate that our method can not only achieve accurate depth estimation for scene perception but also make no concession to the real-time performance and limited memory bottleblock. Multiple-view stereo (MVS) aims to reconstruct the dense 3D representations of scenes. It is widely used in the fields of industrial measurement, autonomous driving, and robotic navigation. To mitigate the error of depth estimation in challenging scenarios, this work proposes a two-stage multi-view stereo network for fast and accurate depth estimation. Our method is the first trainable network that leverages surface normal as pixel-visibility guidance to aggregate reliable cost, which could achieve accurate depth estimation and provide the perception ability for the robot. The proposed method has great potential in the fields of 3D reconstruction, industrial measurement, and robotic navigation to estimate real-time and accurate depth with limited memory consumption.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Normal Assisted Pixel-Visibility Learning With Cost Aggregation for Multiview Stereo

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Transportation Systems

Lead the way for us

Journal: IEEE Transactions on Intelligent Transportation Systems	Publication Date: Dec 1, 2022
Citations: 6

Similar Papers

Multistage Pixel-Visibility Learning With Cost Regularization for Multiview Stereo
Xiaorong Guan ... Shan Jiang
IEEE Transactions on Automation Science and Engineering | VOL. 20
Xiaorong Guan, et. al.Xiaorong Guan ... Shan Jiang
01 Apr 2023
IEEE Transactions on Automation Science and Engineering | VOL. 20

A Simple Baseline for Fast and Accurate Depth Estimation on Mobile Devices
Ziyu Zhang ... Yicheng Wang
-
Ziyu Zhang, et. al.Ziyu Zhang ... Yicheng Wang
01 Jun 2021
01 Jun 2021

Stereo depth estimation under different camera calibration and alignment errors
Xiaofeng Ding ... Xin Wang
Applied Optics | VOL. 50
Xiaofeng Ding, et. al.Xiaofeng Ding ... Xin Wang
23 Mar 2011
Applied Optics | VOL. 50

Accurate unsupervised monocular depth estimation for ill-posed region
Xiaofeng Wang ... Hao Qin
Frontiers in Physics | VOL. 10
Xiaofeng Wang, et. al.Xiaofeng Wang ... Hao Qin
12 Jan 2023
Frontiers in Physics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Normal Assisted Pixel-Visibility Learning With Cost Aggregation for Multiview Stereo

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Intelligent Transportation Systems