See360: Novel Panoramic View Interpolation.

Zhi-Song Liu,Marie-Paule Cani,Wan-Chi Siu

doi:10.1109/tip.2022.3148819

Zhi-Song Liu, Marie-Paule Cani + Show 1 more

Open Access

PDF Available

https://doi.org/10.1109/tip.2022.3148819

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

We present See360, which is a versatile and efficient framework for 360° panoramic view interpolation using latent space viewpoint estimation. Most of the existing view rendering approaches only focus on indoor or synthetic 3D environments and render new views of small objects. In contrast, we suggest to tackle camera-centered view synthesis as a 2D affine transformation without using point clouds or depth maps, which enables an effective 360° panoramic scene exploration. Given a pair of reference images, the See360 model learns to render novel views by a proposed novel Multi-Scale Affine Transformer (MSAT), enabling the coarse-to-fine feature rendering. We also propose a Conditional Latent space AutoEncoder (C-LAE) to achieve view interpolation at any arbitrary angle. To show the versatility of our method, we introduce four training datasets, namely UrbanCity360, Archinterior360, HungHom360 and Lab360, which are collected from indoor and outdoor environments for both real and synthetic rendering. Experimental results show that the proposed method is generic enough to achieve real-time rendering of arbitrary views for all four datasets. In addition, our See360 model can be applied to view synthesis in the wild: with only a short extra training time (approximately 10 mins), and is able to render unknown real-world scenes. The superior performance of See360 opens up a promising direction for camera-centered view rendering and 360° panoramic view interpolation.

Highlights

We present See360, which is a versatile and efficient framework for 360◦ panoramic view interpolation using latent space viewpoint estimation
Our method differs from novel view rendering since our goal is to capture the 3D structure of the surroundings rather than the structure of a single object
To render a novel view in a given camera pose, See360 extends traditional GANs by introducing a Conditional Latent space AutoEncoder (C-LAE) that maps the 3D camera pose to 2D image projection

Summary

Introduction

We present See360, which is a versatile and efficient framework for 360◦ panoramic view interpolation using latent space viewpoint estimation. We can use RGBD cameras to capture depth for 3-DoF or 6-DoF rendering, enabling depth estimation [1], [2], semantic segmentation [3], [4], [5] and salience prediction [6], [7], [8] In contrast with both 360◦ video and novel view rendering, but bridging the gap between them, our goal is to achieve camera centered, 360◦ panoramic novel view interpolation. Ground truth d) Segmentation Comparison the very small differences are mainly located around edges, which indicates that the global information matches in the low frequency domain, despite of some high frequency information losses This good pixel fidelity enables to use the generated images for other applications, such as semantic segmentation (see Figure 1(d))

Objectives

Methods

Findings

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society	Publication Date: Jan 1, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

See360: Novel Panoramic View Interpolation.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

Lead the way for us

Similar Papers

Coordinate transformation by radial basis function neural network

Scientific Research and Essays | VOL. 5

18 Oct 2010
Scientific Research and Essays | VOL. 5

Error supression in view synthesis using reliability reasoning for FTV
Lu Yang ... Mehrdad Panahpour Tehrani
-
Lu Yang, et. al.Lu Yang ... Mehrdad Panahpour Tehrani
01 Jun 2010
01 Jun 2010

2D affine transformations cannot account for human 3D object recognition
Z Liu ... D Kersten
-
Z Liu, et. al.Z Liu ... D Kersten
04 Jan 1998
04 Jan 1998

Analysis of the Fitting Accuracy of the 3d Affine Transformation Applied to Cartosat-1 (IRS P5) Satellite Stereo Imagery
Farzaneh Dadras Javan ... Ali Azizi
International Journal of Advanced Remote Sensing and GIS | VOL. 5
Farzaneh Dadras Javan, et. al.Farzaneh Dadras Javan ... Ali Azizi
27 Jun 2016
International Journal of Advanced Remote Sensing and GIS | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

See360: Novel Panoramic View Interpolation.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society