A Lightweight Neural Network for Monocular View Generation With Occlusion Handling.

Simon Evain,Christine Guillemot

doi:10.1109/tpami.2019.2960689

Abstract

In this article, we present a very lightweight neural network architecture, trained on stereo data pairs, which performs view synthesis from one single image. With the growing success of multi-view formats, this problem is indeed increasingly relevant. The network returns a prediction built from disparity estimation, which fills in wrongly predicted regions using a occlusion handling technique. To do so, during training, the network learns to estimate the left-right consistency structural constraint on the pair of stereo input images, to be able to replicate it at test time from one single image. The method is built upon the idea of blending two predictions: a prediction based on disparity estimation and a prediction based on direct minimization in occluded regions. The network is also able to identify these occluded areas at training and at test time by checking the pixelwise left-right consistency of the produced disparity maps. At test time, the approach can thus generate a left-side and a right-side view from one input image, as well as a depth map and a pixelwise confidence measure in the prediction. The work outperforms visually and metric-wise state-of-the-art approaches on the challenging KITTI dataset, all while reducing by a very significant order of magnitude (5 or 10 times) the required number of parameters (6.5 M).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Lightweight Neural Network for Monocular View Generation With Occlusion Handling.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Dec 19, 2019
Citations: 38

Similar Papers

Hair-GAN: Recovering 3D hair structure from a single image using generative adversarial networks
Meng Zhang ... Youyi Zheng
Visual Informatics | VOL. 3
Meng Zhang, et. al.Meng Zhang ... Youyi Zheng
01 Jun 2019
Visual Informatics | VOL. 3

Deep Relation Learning for Regression and Its Application to Brain Age Estimation.
Sheng He ... Yanfang Feng
IEEE Transactions on Medical Imaging | VOL. 41
Sheng He, et. al.Sheng He ... Yanfang Feng
01 Sep 2022
IEEE Transactions on Medical Imaging | VOL. 41

Peeking behind objects: Layered depth prediction from a single image
Helisa Dhamo ... Federico Tombari
Pattern Recognition Letters | VOL. 125
Helisa Dhamo, et. al.Helisa Dhamo ... Federico Tombari
06 May 2019
Pattern Recognition Letters | VOL. 125

Vehicle Detection and Disparity Estimation Using Blended Stereo Images
Changxin Zhou ... Quansen Sun
IEEE Transactions on Intelligent Vehicles | VOL. 6
Changxin Zhou, et. al.Changxin Zhou ... Quansen Sun
01 Dec 2021
IEEE Transactions on Intelligent Vehicles | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Lightweight Neural Network for Monocular View Generation With Occlusion Handling.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence