BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation.

Zhenyu Li,Junjun Jiang,Xuyang Wang,Xianming Liu

doi:10.1109/tip.2024.3416065

Abstract

Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn increasing attention. Recently, some methods reformulate it as a classification-regression task to boost the model performance, where continuous depth is estimated via a linear combination of predicted probability distributions and discrete bins. In this paper, we present a novel framework called BinsFormer, tailored for the classification-regression-based depth estimation. It mainly focuses on two crucial components in the specific task: 1) proper generation of adaptive bins and 2) sufficient interaction between probability distribution and bins predictions. To specify, we employ a Transformer decoder to generate bins, novelly viewing it as a direct set-to-set prediction problem. We further integrate a multi-scale decoder structure to achieve a comprehensive understanding of spatial geometry information and estimate depth maps in a coarse-to-fine manner. Moreover, an extra scene understanding query is proposed to improve the estimation accuracy, which turns out that models can implicitly learn useful information from the auxiliary environment classification task. Extensive experiments on the KITTI, NYU, and SUN RGB-D datasets demonstrate that BinsFormer surpasses state-of-the-art MDE methods with prominent margins. Code and pretrained models are made publicly available at https://github.com/zhyever/Monocular-Depth-Estimation-Toolbox/tree/ main/configs/binsformer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Journal: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society	Publication Date: Jan 1, 2024
Citations: 11

Similar Papers

Geometry Meets Semantics for Semi-supervised Monocular Depth Estimation
Pierluigi Zama Ramirez ... Matteo Poggi
-
Pierluigi Zama Ramirez, et. al.Pierluigi Zama Ramirez ... Matteo Poggi
01 Jan 2019
01 Jan 2019

MD-ST: Monocular Depth Estimation Based on Spatio-Temporal Correlation Features
Xuyang Meng ... Runqing Zhang
-
Xuyang Meng, et. al.Xuyang Meng ... Runqing Zhang
01 Jan 2020
01 Jan 2020

Swin-Depth: Using Transformers and Multi-Scale Fusion for Monocular-Based Depth Estimation
Zeyu Cheng ... Yi Zhang
IEEE Sensors Journal | VOL. 21
Zeyu Cheng, et. al.Zeyu Cheng ... Yi Zhang
01 Dec 2021
IEEE Sensors Journal | VOL. 21

DepthFormer: Exploiting Long-range Correlation and Local Information for Accurate Monocular Depth Estimation
Zhenyu Li ... Junjun Jiang
Machine Intelligence Research | VOL. 20
Zhenyu Li, et. al.Zhenyu Li ... Junjun Jiang
13 Sep 2023
Machine Intelligence Research | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society