ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation

Daitao Xing,Anthony Tzes,Jinglin Shen,Chiuman Ho

doi:10.1609/aaai.v37i3.25401

Abstract

The exploration of mutual-benefit cross-domains has shown great potential toward accurate self-supervised depth estimation. In this work, we revisit feature fusion between depth and semantic information and propose an efficient local adaptive attention method for geometric aware representation enhancement. Instead of building global connections or deforming attention across the feature space without restraint, we bound the spatial interaction within a learnable region of interest. In particular, we leverage geometric cues from semantic information to learn local adaptive bounding boxes to guide unsupervised feature aggregation. The local areas preclude most irrelevant reference points from attention space, yielding more selective feature learning and faster convergence. We naturally extend the paradigm into a multi-head and hierarchic way to enable the information distillation in different semantic levels and improve the feature discriminative ability for fine-grained depth estimation. Extensive experiments on the KITTI dataset show that our proposed method establishes a new state-of-the-art in self-supervised monocular depth estimation task, demonstrating the effectiveness of our approach over former Transformer variants.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 2

Similar Papers

Stereo depth estimation under different camera calibration and alignment errors
Xiaofeng Ding ... Xin Wang
Applied Optics | VOL. 50
Xiaofeng Ding, et. al.Xiaofeng Ding ... Xin Wang
23 Mar 2011
Applied Optics | VOL. 50

Optimizing Underwater Image Restoration and Depth Estimation with Light Field Images
Bo Xiao ... Hongwu Huang
Journal of Marine Science and Engineering | VOL. 12
Bo Xiao, et. al.Bo Xiao ... Hongwu Huang
02 Jun 2024
Journal of Marine Science and Engineering | VOL. 12

Adversarial Learning for Joint Optimization of Depth and Ego-Motion.
Anjie Wang ... Shanshe Wang
IEEE Transactions on Image Processing | VOL. 29
Anjie Wang, et. al.Anjie Wang ... Shanshe Wang
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 29

Accurate depth estimation from a hybrid event-RGB stereo setup
Yi-Fan Zuo ... Xia Wang
-
Yi-Fan Zuo, et. al.Yi-Fan Zuo ... Xia Wang
27 Sep 2021
27 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence