Novel view synthesis with wide-baseline stereo pairs based on local–global information

Kai Song,Lei Zhang

doi:10.1016/j.cag.2024.104139

Abstract

Novel view synthesis generates images from new views using multiple images of a scene in known views. Using wide-baseline stereo image pairs for novel view synthesis allows scenes to be rendered from varied perspectives with only two images, significantly reducing image acquisition and storage costs and improving 3D scene reconstruction efficiency. However, the large geometry difference and severe occlusion between a pair of wide-baseline stereo images often cause artifacts and holes in the novel view images. To address these issues, we propose a method that integrates both local and global information for synthesizing novel view images from wide-baseline stereo image pairs. Initially, our method aggregates cost volume with local information using Convolutional Neural Network (CNN) and employs Transformer to capture global features. This process optimizes disparity prediction for improving the depth prediction and reconstruction quality of 3D scene representations with wide-baseline stereo image pairs. Subsequently, our method uses CNN to capture local semantic information and Transformer to model long-range contextual dependencies, generating high-quality novel view images. Extensive experiments demonstrate that our method can effectively reduce artifacts and holes, thereby enhancing the synthesis quality of novel views from wide-baseline stereo image pairs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Novel view synthesis with wide-baseline stereo pairs based on local–global information

Abstract

Talk to us

Similar Papers

More From: Computers & Graphics

Lead the way for us

Similar Papers

Generating implicit object fragment datasets for machine learning
Alfonso López ... José M Fuertes
Computers & Graphics | VOL. 125
Alfonso López, et. al.Alfonso López ... José M Fuertes
01 Dec 2024
Computers & Graphics | VOL. 125

Enhancing Visual Analytics systems with guidance: A task-driven methodology
Ignacio Pérez-Messina ... Silvia Miksch
Computers & Graphics | VOL. 125
Ignacio Pérez-Messina, et. al.Ignacio Pérez-Messina ... Silvia Miksch
01 Dec 2024
Computers & Graphics | VOL. 125

Enhancing semantic mapping in text-to-image diffusion via Gather-and-Bind
Huan Fu ... Guoqing Cheng
Computers & Graphics | VOL. 125
Huan Fu, et. al.Huan Fu ... Guoqing Cheng
01 Dec 2024
Computers & Graphics | VOL. 125

Editorial Note Computers & Graphics Issue 125
Joaquim Jorge
Computers & Graphics | VOL. -
Joaquim JorgeJoaquim Jorge
01 Dec 2024
Computers & Graphics | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Novel view synthesis with wide-baseline stereo pairs based on local–global information

Abstract

Talk to us

Similar Papers

More From: Computers & Graphics