Encoder-Decoder Structure With the Feature Pyramid for Depth Estimation From a Single Image

Mengxia Tang,Ruifang Dong,Songnan Chen,Jiangming Kan

doi:10.1109/access.2021.3055497

Mengxia Tang, Ruifang Dong + Show 2 more

Open Access

PDF Available

https://doi.org/10.1109/access.2021.3055497

Copy DOI

Export

Save

Cite

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 5	License type: CC BY 4.0

Affiliation: Beijing Forestry University

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

We address the problem of depth estimation from a single monocular image in the paper. Depth estimation from a single image is an ill-posed and inherently ambiguous problem. In the paper, we propose an encoder-decoder structure with the feature pyramid to predict the depth map from a single RGB image. More specifically, the feature pyramid is used to detect objects of different scales in the image. The encoder structure aims to extract the most representative information from the original image through a series of convolution operations and to reduce the resolution of the input image. We adopt Res2-50 as the encoder to extract important features. The decoder section uses a novel upsampling structure to improve the output resolution. Then, we also propose a novel loss function that adds gradient loss and surface normal loss to the depth loss, which can predict not only the global depth but also the depth of fuzzy edges and small objects. Additionally, we use Adam as our optimization function to optimize our network and speed up convergence. Our extensive experimental evaluation proves the efficiency and effectiveness of the method, which is competitive with previous methods on the Make3D dataset and outperforms state-of-the-art methods on the NYU Depth v2 dataset.

Highlights

Estimating the dense and accurate depth of a scene from a single RGB image is one of the fundamental problems of computer vision and essential for various applications, such as scene understanding [1]–[4], 3D modeling [5], [6], robotics [7], [8], virtual reality [9], and autonomous driving [10]
Given the training set RGB image and the corresponding depth map of the image, depth prediction can be regarded as a pixel-level regression problem; that is, the model directly learns to predict the depth corresponding to each pixel in the single image
We propose a novel method for monocular depth estimation

Summary

INTRODUCTION

Estimating the dense and accurate depth of a scene from a single RGB image is one of the fundamental problems of computer vision and essential for various applications, such as scene understanding [1]–[4], 3D modeling [5], [6], robotics [7], [8], virtual reality [9], and autonomous driving [10]. Estimating the depth from a single image is an ill-posed and inherent ambiguous problem. Previous studies have shown that depth estimation, similar to other pixel-level classification or regression tasks, can be performed using the convolutional neural networks (CNNs) model. We present a novel approach for estimating depth from a single image. M. Tang et al.: Encoder-Decoder Structure With the Feature Pyramid for Depth Estimation From a Single Image the different methods used for depth estimation in the past.

RELATED WORK

METHODS

LOSS FUNCTION

EXPERIMENTION

EVALUATION METRICS

Findings

DISCUSSION AND CONCLUSION

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Encoder-Decoder Structure With the Feature Pyramid for Depth Estimation From a Single Image

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Detection and Depth Estimation for Objects from Single Monocular Image
Ziwen Xu ... Yingmin Jia
-
Ziwen Xu, et. al.Ziwen Xu ... Yingmin Jia
24 Sep 2020
24 Sep 2020

Monocular Depth Estimation Using Encoder-Decoder Architecture and Transfer Learning from Single RGB Image
Hritam Basak ... Sagnik Ghosal
-
Hritam Basak, et. al.Hritam Basak ... Sagnik Ghosal
27 Nov 2020
27 Nov 2020

Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks
...
-
, et. al. ...
25 Apr 2016
25 Apr 2016

Improvised Filter Design for Depth Estimation from Single Monocular Images
Aniruddha Das ... Jignesh Bhavsar
-
Aniruddha Das, et. al.Aniruddha Das ... Jignesh Bhavsar
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Encoder-Decoder Structure With the Feature Pyramid for Depth Estimation From a Single Image

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access