TP-GAN: Simple Adversarial Network With Additional Player for Dense Depth Image Estimation

Andi Hendra,Yasushi Kanazawa

doi:10.1109/access.2023.3272292

Andi Hendra, Yasushi Kanazawa

Open Access

https://doi.org/10.1109/access.2023.3272292

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2023
License type: CC BY-NC-ND 4.0

Affiliation: Toyohashi University of Technology

Abstract

We present a simple yet robust monocular depth estimation technique by synthesizing a depth map image from a single RGB input image using the advantage of generative adversarial networks (GAN). We employ an additional sub-model termed refiner to extract local depth features, then combine it with the global scene information from the generator to improve the GAN’s performance compared to the standard GAN architectural scheme. Notably, the generator is the first player to learn to synthesize depth images. The second player, the discriminator, classifies the generated depth. In the meantime, the third player, the refiner, enhances the final reconstructed depth. Complementing the GAN model, we apply a conditional generative network (cGAN) to lead the generator in mapping the input image to the respective depth representation. We further incorporate a structured similarity (SSIM) as our loss function for the generator and refiner in GAN training. Through extensive experiment validation, we confirmed the performance of our strategy on the publicly indoor NYU Depth v2 and KITTI outdoor data. Experiment results on the NYU depth v2 dataset show that our proposed approach achieves the best performance by 96.0% on threshold accuracy ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$\delta < 1.25^{2}$ </tex-math></inline-formula> ) and the second-best accuracy on all thresholds on the KITTI dataset. We discovered that our proposed method compares favorably to numerous existing monocular depth estimation strategies and demonstrates a considerable improvement in the accuracy of image depth estimation despite its simple network architecture.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TP-GAN: Simple Adversarial Network With Additional Player for Dense Depth Image Estimation

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Improving Training of Generative Adversarial Networks
...
-
, et. al. ...
17 Mar 2021
17 Mar 2021

End-to-End Image Super-Resolution via Generative Adversarial Network
Xu Li ... Xiangdong You
IOP Conference Series: Materials Science and Engineering | VOL. 768
Xu Li, et. al.Xu Li ... Xiangdong You
01 Mar 2020
IOP Conference Series: Materials Science and Engineering | VOL. 768

Monocular Depth Estimation Using a Laplacian Image Pyramid with Local Planar Guidance Layers.
Youn-Ho Choi ... Seok-Cheol Kee
Sensors (Basel, Switzerland) | VOL. 23
Youn-Ho Choi, et. al.Youn-Ho Choi ... Seok-Cheol Kee
11 Jan 2023
Sensors (Basel, Switzerland) | VOL. 23

Comparison of Deep Convolutional GAN and Progressive GAN for facial image generation
Hanjing Zhu
Applied and Computational Engineering | VOL. 18
Hanjing ZhuHanjing Zhu
23 Oct 2023
Applied and Computational Engineering | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TP-GAN: Simple Adversarial Network With Additional Player for Dense Depth Image Estimation

Abstract

Talk to us

Similar Papers

More From: IEEE Access