Modeling Long-range Dependencies and Epipolar Geometry for Multi-view Stereo

Jie Zhu,Wanqing Li,Bo Peng,Qingming Huang,Haifeng Shen,Jianjun Lei

doi:10.1145/3596445

Jie Zhu, Wanqing Li + Show 4 more

https://doi.org/10.1145/3596445

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

This article proposes a network, referred to as Multi-View Stereo TRansformer (MVSTR) for depth estimation from multi-view images. By modeling long-range dependencies and epipolar geometry, the proposed MVSTR is capable of extracting dense features with global context and 3D consistency, which are crucial for reliable matching in multi-view stereo (MVS). Specifically, to tackle the problem of the limited receptive field of existing CNN-based MVS methods, a global-context Transformer module is designed to establish intra-view long-range dependencies so that global contextual features of each view are obtained. In addition, to further enable features of each view to be 3D consistent, a 3D-consistency Transformer module with an epipolar feature sampler is built, where epipolar geometry is modeled to effectively facilitate cross-view interaction. Experimental results show that the proposed MVSTR achieves the best overall performance on the DTU dataset and demonstrates strong generalization on the Tanks & Temples benchmark dataset.

Full Text

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Modeling Long-range Dependencies and Epipolar Geometry for Multi-view Stereo

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Jul 12, 2023
Citations: 14

Similar Papers

Model long-range dependencies for multi-modality and multi-view retinopathy diagnosis through transformers
Yonghao Huang ... Yang Wen
Knowledge-Based Systems | VOL. 271
Yonghao Huang, et. al.Yonghao Huang ... Yang Wen
06 Apr 2023
Knowledge-Based Systems | VOL. 271

MMNet: A Mixing Module Network for Polyp Segmentation.
Raman Ghimire ... Sang-Woong Lee
Sensors (Basel, Switzerland) | VOL. 23
Raman Ghimire, et. al.Raman Ghimire ... Sang-Woong Lee
18 Aug 2023
Sensors (Basel, Switzerland) | VOL. 23

HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation
Moein Heidari ... Milad Soltany
-
Moein Heidari, et. al.Moein Heidari ... Milad Soltany
01 Jan 2023
01 Jan 2023

Integrating prior knowledge into a bibranch pyramid network for medical image segmentation
Xianjun Han ... Hongyu Yang
Image and Vision Computing | VOL. 143
Xianjun Han, et. al.Xianjun Han ... Hongyu Yang
17 Feb 2024
Image and Vision Computing | VOL. 143

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Modeling Long-range Dependencies and Epipolar Geometry for Multi-view Stereo

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications