Rich global feature guided network for monocular depth estimation

Bingyuan Wu,Yongxiong Wang

doi:10.1016/j.imavis.2022.104520

Abstract

Monocular depth estimation is a classical but challenging task in the field of computer vision. In recent years, Convolutional Neural Network (CNN) based models have been developed to estimate high-quality depth map from a single image. Most recently, some Transformer based models have led to great improvements. All the researchers are looking for a better way to handle the global processing of information which is crucial for depth relation inference but of high computational complexity. In this paper, we take advantage of both the Transformer and CNN then propose a novel network architecture, called Rich Global Feature Guided Network (RGFN), with which rich global features are extracted from both encoder and decoder. The framework of the RGFN is the typical encoder-decoder for dense prediction. A hierarchical transformer is implemented as the encoder to capture multi-scale contextual information and model long-range dependencies. In the decoder, the Large Kernel Convolution Attention (LKCA) is adopted to extract global features from different scales and guide the network to recover fine depth maps from low spatial resolution feature maps progressively. What's more, we apply the depth-specific data augmentation method, Vertical CutDepth, to boost the performance. Experimental results on both the indoor and outdoor datasets demonstrate the superiority of the RGFN compared to other state-of-the-art models. Compared with the most recent method AdaBins, RGFN improves the RMSE score by 4.66% on the KITTI dataset and 4.67% on the NYU Depth v2 dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rich global feature guided network for monocular depth estimation

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: Sep 1, 2022
Citations: 6

Similar Papers

Monocular Depth Estimation Using a Laplacian Image Pyramid with Local Planar Guidance Layers.
Youn-Ho Choi ... Seok-Cheol Kee
Sensors (Basel, Switzerland) | VOL. 23
Youn-Ho Choi, et. al.Youn-Ho Choi ... Seok-Cheol Kee
11 Jan 2023
Sensors (Basel, Switzerland) | VOL. 23

Monocular Depth Estimation of Noncooperative Spacecraft Based on Deep Learning
Erxun Zhao ... Jingmin Gao
Journal of Aerospace Information Systems | VOL. 20
Erxun Zhao, et. al.Erxun Zhao ... Jingmin Gao
10 Mar 2023
Journal of Aerospace Information Systems | VOL. 20

SABV-Depth: A biologically inspired deep learning network for monocular depth estimation
Junfan Wang ... Qiheng Miao
Knowledge-Based Systems | VOL. 263
Junfan Wang, et. al.Junfan Wang ... Qiheng Miao
14 Jan 2023
Knowledge-Based Systems | VOL. 263

MD-ST: Monocular Depth Estimation Based on Spatio-Temporal Correlation Features
Xuyang Meng ... Runqing Zhang
-
Xuyang Meng, et. al.Xuyang Meng ... Runqing Zhang
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rich global feature guided network for monocular depth estimation

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing