CORNet: Context-Based Ordinal Regression Network for Monocular Depth Estimation

Xuyang Meng,Yue Ming,Chunxiao Fan,Hui Yu

doi:10.1109/tcsvt.2021.3128505

Abstract

Monocular depth estimation, as one of the fundamental tasks of computer vision, plays a crucial role in three-dimensional (3D) scene understanding and perception. Usually, deep learning methods recover monocular depth maps using continuous regression manners by minimizing the errors between the ground-truth depth and the predicted depth. However, fine depth features may not be fully captured through layer-by-layer coding, which is prone to low spatial resolution depth maps and insufficient details. Furthermore, it usually converges slowly and suffers from unsatisfactory results. To tackle these issues, we propose a novel model, named context-based ordinal regression network (CORNet), to reconstruct monocular depth maps in the ordinal regression manner with context information in this paper. Firstly, we put forward a novel context-based encoder with a feature transformation (FT) module to learn context information and details from inputs, and output multi-scale feature maps. Then, we design a boundary enhancement module (BEM) with a spatial attention mechanism following each operation of feature fusion, which captures boundary features in the scene to enhance the border depth. Finally, a feature optimization module (FOM) is designed to fuse and optimize the multi-scale features and boundary features to strengthen depth learning. What’s more, we introduce an ordinal weighted inference to predict depth maps from probabilities and discretization values. Experiments and results on two challenging datasets, KITTI and NYU Depth V2, demonstrate that our proposed CORNet can estimate monocular depth maps effectively and obtain superior performance in capturing geometric features over existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CORNet: Context-Based Ordinal Regression Network for Monocular Depth Estimation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Jul 1, 2022
Citations: 18

Similar Papers

Open perceptual binocular and monocular descriptors for stereoscopic 3D images and video characterization
Pierre Lebreton ... Patrick Le Callet
-
Pierre Lebreton, et. al.Pierre Lebreton ... Patrick Le Callet
01 May 2015
01 May 2015

P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior
Vaishakh Patil ... Luc Van Gool
-
Vaishakh Patil, et. al.Vaishakh Patil ... Luc Van Gool
01 Jun 2022
01 Jun 2022

MD-ST: Monocular Depth Estimation Based on Spatio-Temporal Correlation Features
Xuyang Meng ... Runqing Zhang
-
Xuyang Meng, et. al.Xuyang Meng ... Runqing Zhang
01 Jan 2020
01 Jan 2020

Monocular Depth Estimation Using Encoder-Decoder Architecture and Transfer Learning from Single RGB Image
Hritam Basak ... Soham Chattopadhyay
-
Hritam Basak, et. al.Hritam Basak ... Soham Chattopadhyay
27 Nov 2020
27 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CORNet: Context-Based Ordinal Regression Network for Monocular Depth Estimation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology