Dual attention-guided self-adaptive aware cascade network for multi-view stereo and stereo matching

Na Cheng,Hongyu Wang,Zhixuan Sun,Xuanbing Zhu,Hongye Xie

doi:10.1109/access.2023.3273903

Abstract

Extensive studies have been conducted on multi-view stereo and stereo matching for 3D reconstruction, whereas relatively few methods have been proposed for a large-scale environment. The difficulty of producing high-resolution depth/disparity maps is one of the main reasons. In this paper, we propose a dual attention-guided self-adaptive aware cascade network (DAscNet) that achieves state-of-the-art results for generating high-resolution depth/disparity maps of complex scenes by introducing a cascade inference strategy using a set of input views. A pyramid cost volume fusion and a self-adaptive cost volume cascade are built upon a dual attention-guided context multi-scale feature extraction encoding geometric, spatial and contextual information at gradually finer scales to achieve robust structural representation for predictions. The dual attention-guided context multi-scale feature extraction is made up of two distinct modules that are both based on the attention mechanism. In the pyramid cost volume fusion, an inter-cost attention aggregation module fuses multiple low-resolution dense cost volumes to achieve a robust structural representation for initial predictions. In the self-adaptive cost volume cascade, a changeable depth/disparity range estimation module is employed to alter the depth/disparity searching range interval of following stage based on the prediction information from the previous stage. This module can drive the network to gradually deal with complicated matching ambiguities and make better the accuracy of depth/disparity searching range interval prediction. Experiments on two publicly available datasets, the Tanks and Temples dataset and the DTU dataset, show that DAscNet outperforms prior work. The effectiveness of our proposed method is also supported by statistics on the accuracy, runtime, and GPU memory of other representative methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2024
Citations: 2	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Dual attention-guided self-adaptive aware cascade network for multi-view stereo and stereo matching

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
Xiaodong Gu ... Zuozhuo Dai
-
Xiaodong Gu, et. al.Xiaodong Gu ... Zuozhuo Dai
01 Jun 2020
01 Jun 2020

A-SATMVSNet: An attention-aware multi-view stereo matching network based on satellite imagery
Li Lin ... Xiongfei Liu
Frontiers in Earth Science | VOL. 11
Li Lin, et. al.Li Lin ... Xiongfei Liu
13 Apr 2023
Frontiers in Earth Science | VOL. 11

Stochastic Optimization Based 3D Dense Reconstruction from Multiple Views with High Accuracy and Completeness
...
Journal of Information Science and Engineering | VOL. 31
, et. al. ...
01 Jan 2015
Journal of Information Science and Engineering | VOL. 31

LNMVSNet: A Low-Noise Multi-View Stereo Depth Inference Method for 3D Reconstruction.
Weiming Luo ... Zongqing Lu
Sensors | VOL. 24
Weiming Luo, et. al.Weiming Luo ... Zongqing Lu
09 Apr 2024
Sensors | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual attention-guided self-adaptive aware cascade network for multi-view stereo and stereo matching

Abstract

Talk to us

Similar Papers

More From: IEEE Access