Monocular depth estimation with boundary attention mechanism and Shifted Window Adaptive Bins

Hengjia Hu,Mengnan Liang,Congcong Wang,Meng Zhao,Fan Shi,Chao Zhang,Yilin Han

doi:10.1016/j.cviu.2024.104220

Abstract

Monocular depth estimation is a classic research topic in computer vision. In recent years, development of Convolutional Neural Networks (CNNs) has facilitated significant breakthroughs in this field. However, there still exist two challenges: (1) The network struggles to effectively fuse edge features in the feature fusion stage, which ultimately results in the loss of structure or boundary distortion of objects in the scene. (2) Classification based studies typically depend on Transformers for global modeling, a process that often introduces substantial computational complexity overhead as described in Equation 2. In this paper, we propose two modules to address the aforementioned issues. The first module is the Boundary Attention Module (BAM), which leverages the attention mechanism to enhance the ability of the network to perceive object boundaries during the feature fusion stage. In addition, to mitigate the computational complexity overhead resulting from predicting adaptive bins, we propose a Shift Window Adaptive Bins (SWAB) module to reduce the amount of computation in global modeling. The proposed method is evaluated on three public datasets, NYU Depth V2, KITTI and SUNRGB-D, and demonstrates state-of-the-art (SOTA) performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Monocular depth estimation with boundary attention mechanism and Shifted Window Adaptive Bins

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding

Lead the way for us

Similar Papers

Attention Mechanism Used in Monocular Depth Estimation: An Overview
Yundong Li ... Hanlu Fan
Applied Sciences | VOL. 13
Yundong Li, et. al.Yundong Li ... Hanlu Fan
02 Sep 2023
Applied Sciences | VOL. 13

Towards Good Practice for CNN-Based Monocular Depth Estimation
Zhicheng Fang ... Yuhua Chen
-
Zhicheng Fang, et. al.Zhicheng Fang ... Yuhua Chen
01 Mar 2020
01 Mar 2020

MD-ST: Monocular Depth Estimation Based on Spatio-Temporal Correlation Features
Xuyang Meng ... Runqing Zhang
-
Xuyang Meng, et. al.Xuyang Meng ... Runqing Zhang
01 Jan 2020
01 Jan 2020

Monocular Depth Estimation via Self-Supervised Self-Distillation.
Haifeng Hu ... Yuyang Feng
Sensors (Basel, Switzerland) | VOL. 24
Haifeng Hu, et. al.Haifeng Hu ... Yuyang Feng
24 Jun 2024
Sensors (Basel, Switzerland) | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monocular depth estimation with boundary attention mechanism and Shifted Window Adaptive Bins

Abstract

Talk to us

Similar Papers

More From: Computer Vision and Image Understanding