Deep Photometric Stereo Network with Multi-Scale Feature Aggregation.

Chanki Yu,Sang Wook Lee

doi:10.3390/s20216261

Abstract

We present photometric stereo algorithms robust to non-Lambertian reflection, which are based on a convolutional neural network in which surface normals of objects with complex geometry and surface reflectance are estimated from a given set of an arbitrary number of images. These images are taken from the same viewpoint under different directional illumination conditions. The proposed method focuses on surface normal estimation, where multi-scale feature aggregation is proposed to obtain a more accurate surface normal, and max pooling is adopted to obtain an intermediate order-agnostic representation in the photometric stereo scenario. The proposed multi-scale feature aggregation scheme using feature concatenation is easily incorporated into existing photometric stereo network architectures. Our experiments were performed with a DiLiGent photometric stereo benchmark dataset consisting of ten real objects, and they demonstrated that the accuracies of our calibrated and uncalibrated photometric stereo approaches were improved over those of baseline methods. In particular, our experiments also demonstrated that our uncalibrated photometric stereo outperformed the state-of-the-art method. Our work is the first to consider the multi-scale feature aggregation in photometric stereo, and we showed that our proposed multi-scale fusion scheme estimated the surface normal accurately and was beneficial to improving performance.

Highlights

In Woodham’s work, the orientation on the surface of an object is determined from a set of at least three images captured from a fixed orthographic camera under different illumination directions
We present a convolutional neural network (CNN)-based method to discover the relationship between the surface normal and a set of an arbitrary number of images taken under a photometric stereo setup
Most of the hyper-parameters setting of the proposed normal estimation network (NENet) for the calibrated and uncalibrated photometric stereo follow those of PS-fully convolutional neural network (FCN) and SDPS-Net, respectively

Summary

Introduction

In Woodham’s work, the orientation on the surface of an object is determined from a set of at least three images captured from a fixed orthographic camera under different illumination directions. The example-based photometric stereo uses the reference objects, such as multiple types of spheres, with a homogeneous material property that is placed with target objects in the same scene [34,35,36] This approach adopts an orientation consistency cue—the same image irradiance value is observed at two different points on the surface of objects having identical surface appearances and surface normals under the same illumination [34]. Deep learning algorithms have recently achieved remarkable progress in various domains, such as computer vision, speech recognition, and natural language processing Following this trend, recent advances in the calibrated and uncalibrated photometric stereo to produce a high-fidelity surface normal have been achieved employing deep learning, where the deep photometric stereo network learns the mapping from the multiple images to the surface normal vector [40,41,42,43,44,45,46].

Related Work

Fully Convolutional Neural Network with a Multi-Scale Feature Aggregation

Image Formation Model

Calibrated Photometric Stereo Network

Uncalibrated Photometric Stereo Network

Network

Experimental Results

Figure

Method

Comparision with Other Models

Conclusions

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Nov 3, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

Deep Photometric Stereo Network with Multi-Scale Feature Aggregation.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Hybrid Uncalibrated Near-light Photometric Stereo in Realistic Environment
Wu Ran ... Xing Zhu
-
Wu Ran, et. al.Wu Ran ... Xing Zhu
28 May 2022
28 May 2022

Robust surface normal estimation via greedy sparse regression
Mingjing Zhang ... Mark S Drew
EURASIP Journal on Image and Video Processing | VOL. 2015
Mingjing Zhang, et. al.Mingjing Zhang ... Mark S Drew
01 Dec 2015
EURASIP Journal on Image and Video Processing | VOL. 2015

Multi-scale feature aggregation and fusion network with self-supervised multi-level perceptual loss for textures preserving low-dose CT denoising
Yuanke Zhang ... Guangshun Li
Physics in Medicine & Biology | VOL. 69
Yuanke Zhang, et. al.Yuanke Zhang ... Guangshun Li
26 Apr 2024
Physics in Medicine & Biology | VOL. 69

Self-Calibrating Sparse Far-Field Photometric Stereo With Collocated Light
Xi Wang ... Hang Yuan
IEEE Transactions on Instrumentation and Measurement | VOL. 71
Xi Wang, et. al.Xi Wang ... Hang Yuan
01 Jan 2021
IEEE Transactions on Instrumentation and Measurement | VOL. 71

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deep Photometric Stereo Network with Multi-Scale Feature Aggregation.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)