Adversarial Learning for Depth and Viewpoint Estimation From a Single Image

Saddam Abdulwahab,Sylvie Chambon,Domenec Puig,Mohammed Jabreel,Miguel Angel Garcia,Hatem A Rashwan

doi:10.1109/tcsvt.2020.2973068

Abstract

Estimating a depth map and, at the same time, predicting the 3D pose of an object from a single 2D color image is a very challenging task. Depth estimation is typically performed through stereo vision by following several time-consuming stages, such as epipolar geometry, rectification and matching. Alternatively, when stereo vision is not useful or applicable, depth relations can be inferred from a single image as studied in this paper. More precisely, deep learning is applied in order to solve the problem of estimating a depth map from a single image. Then, that map is used for predicting the 3D pose of the main object depicted in the image. The proposed model consists of two successive neural networks. The first network is based on a Generative Adversarial Neural network (GAN). It estimates a dense depth map from the given color image. A Convolutional Neural Network (CNN) is then used to predict the 3D pose from the generated depth map through regression. The main difficulty to jointly estimate depth maps and 3D poses using deep networks is the lack of training data with both depth and viewpoint annotations. This contribution assumes a cross-domain training procedure with 3D CAD models corresponding to objects appearing in real images in order to render depth images from different viewpoints. These rendered images are then used to guide the GAN network to learn the mapping from the image domain to the depth domain. By exploiting the dataset as a source of training data, the proposed model outperforms state-of-the-art models on the PASCAL 3D+ dataset. The code of the proposed model is publicly available at https://github.com/SaddamAbdulrhman/Depth-and-Viewpoint-Estimation/tree/master.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial Learning for Depth and Viewpoint Estimation From a Single Image

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Sep 1, 2020
Citations: 10

Similar Papers

Convolutional Neural Network based Age Estimation from Facial Image and Depth Prediction from Single Image

-

01 Jan 2015
01 Jan 2015

Depth-aware salient object segmentation
Le Vu Ha ... Tran Hoang Tung
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Le Vu Ha, et. al.Le Vu Ha ... Tran Hoang Tung
07 Oct 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Depth Estimation from Single Hazy Images with 2-Phase Training
Laksmita Rahadianti ... Fumihiko Sakaue
-
Laksmita Rahadianti, et. al.Laksmita Rahadianti ... Fumihiko Sakaue
17 Oct 2020
17 Oct 2020

Depth estimation from a single image in pedestrian candidate generation
Yali Guo ... Huiqi Li
-
Yali Guo, et. al.Yali Guo ... Huiqi Li
01 Jun 2016
01 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Learning for Depth and Viewpoint Estimation From a Single Image

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology