Dehazing cost volume for deep multi-view stereo in scattering media with airlight and scattering coefficient estimation

Yuki Fujimura,Motoharu Sonogashira,Masaaki Iiyama

doi:10.1016/j.cviu.2021.103253

Yuki Fujimura, Motoharu Sonogashira + Show 1 more

Open Access

https://doi.org/10.1016/j.cviu.2021.103253

Copy DOI

Journal: Computer Vision and Image Understanding	Publication Date: Jul 31, 2021
Citations: 2	License type: cc-by

Affiliation: Kyoto University

Abstract

We propose a learning-based multi-view stereo (MVS) method in scattering media, such as fog or smoke, with a novel cost volume, called the dehazing cost volume. Images captured in scattering media are degraded due to light scattering and attenuation caused by suspended particles. This degradation depends on scene depth; thus, it is difficult for traditional MVS methods to evaluate photometric consistency because the depth is unknown before three-dimensional (3D) reconstruction. The dehazing cost volume can solve this chicken-and-egg problem of depth estimation and image restoration by computing the scattering effect using swept planes in the cost volume. We also propose a method of estimating scattering parameters, such as airlight, and a scattering coefficient, which are required for our dehazing cost volume. The output depth of a network with our dehazing cost volume can be regarded as a function of these parameters; thus, they are geometrically optimized with a sparse 3D point cloud obtained at a structure-from-motion step. Experimental results on synthesized hazy images indicate the effectiveness of our dehazing cost volume against the ordinary cost volume regarding scattering media. We also demonstrated the applicability of our dehazing cost volume to real foggy scenes.

Highlights

Three-dimensional (3D) reconstruction from 2D images is important in computer vision
The proposed method (MVDepthMet w/ dcv, where “dcv” denotes our dehazing cost volume) was compared with MVDepthNet [39] fine-tuned on hazy images (MVDepthNet), simple sequential methods of dehazing [20, 27] and depth estimation with MVDepthNet [39] (AOD-Net + MVDepthNet, FFA-Net + MVDepthNet), and DPSNet [18] trained on hazy images (DPSNet)
We proposed a learning-based multi-view stereo (MVS) method with a novel cost volume, called the dehazing cost volume, which enables MVS methods to be used in scattering media

Summary

Introduction

Three-dimensional (3D) reconstruction from 2D images is important in computer vision. Images captured in scattering media, such as fog or smoke, degrade due to light scattering and attenuation caused by suspended particles. Traditional 3D reconstruction methods that exploit observed pixel intensity cannot work in such environments. We propose a learning-based multi-view stereo (MVS) method in scattering media. MVS methods [13] are used (a)

Objectives

Methods

Results

Conclusion