Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Naeem Ul Islam,Jaebyung Park

doi:10.1109/access.2021.3060435

Abstract

Estimating the depth map from a single RGB image is important to understand the nature of the terrain in robot navigation and has attracted considerable attention in the past decade. The existing approaches can accurately estimate the depth from a single RGB image, considering a highly structured environment. The problem becomes more challenging when the terrain is highly dynamic. We propose a fine-tuned generative adversarial network to estimate the depth map effectively for a given single RGB image. The proposed network is composed of a fine-tuned generator and a global discriminator. The encoder part of the generator takes input RGB images and depth maps and generates their joint distribution in the latent space. Subsequently, the decoder part of the generator decodes the depth map from the joint distribution. The discriminator takes real and fake pairs in three different configurations and then guides the generator to estimate the depth map from the given RGB image accordingly. Finally, we conducted extensive experiments with a highly dynamic environment dataset for verifying the effectiveness and feasibility of the proposed approach. The proposed approach could decode the depth map from the joint distribution more effectively and accurately than the existing approaches.

Highlights

Depth estimation from a single image has a long history owing to its application in computer vision and robot navigation, both for indoor and outdoor environments
Our approach aims to perform an accurate translation of the RGB images into their corresponding depth maps with a
We perform a comparative analysis of the proposed approach with the conditional generative adversarial network (cGAN)-based approach [31], BA-DualAE [29], consistent imageto-image translation network (CITN) [28], MSDN [13], and FCN [36], respectively, in terms of three different datasets including the RealSense depth dataset [37], Cityscapes [15], and NYU dataset [16]

Summary

INTRODUCTION

Depth estimation from a single image has a long history owing to its application in computer vision and robot navigation, both for indoor and outdoor environments. Several recent studies have posed the task of monocular depth estimation as a supervised learning problem to overcome the limitations of the aforementioned approaches [12,13,14] These approaches attempt to regress the depth of each pixel in an image directly using network models that have been trained on a large amount of depth data. For a highly dynamic and uneven terrain, estimating depth from a single RGB image is a difficult task for unsupervisedregression-based approaches, as the corresponding target changes continuously. We propose a fine-tuned generator-based conditional adversarial network to address the problem of depth estimation while considering a dynamic terrain. A unified fine-tuned generator-based conditional adversarial network: To address the problem of translating a single RGB image to its corresponding depth map, we proposed a fine-tuned generatorbased conditional adversarial network that handles the dynamic environment effectively

Joint latent space

Stability of the training

RELATED WORK

PROPOSED APPROACH

NETWORK ARCHITECTURE AND TRAINING

RESULTS

ANALYSIS BASED ON REALSENSE DEPTH DATASET

ANALYSIS BASED ON NYU DATASET

CONCLUSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 36	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Multi-stage cascaded deconvolution for depth map and surface normal prediction from single image
Ram Prasad Padhy ... Sambit Bakshi
Pattern Recognition Letters | VOL. 127
Ram Prasad Padhy, et. al.Ram Prasad Padhy ... Sambit Bakshi
06 Jul 2018
Pattern Recognition Letters | VOL. 127

Generate What You Can’t See - a View-dependent Image Generation
Karol Piaskowski ... Rafal Staszak
-
Karol Piaskowski, et. al.Karol Piaskowski ... Rafal Staszak
01 Nov 2019
01 Nov 2019

Towards Spectral Estimation from a Single RGB Image in the Wild
Berk Kaya ... Radu Timofte
-
Berk Kaya, et. al.Berk Kaya ... Radu Timofte
01 Oct 2019
01 Oct 2019

Teacher-Student Adversarial Depth Hallucination to Improve Face Recognition
Hardik Uppal ... Alireza Sepas-Moghaddam
-
Hardik Uppal, et. al.Hardik Uppal ... Alireza Sepas-Moghaddam
01 Oct 2021
01 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access