Discrete Spherical Image Representation for CNN-Based Inclination Estimation

Yuhao Shan,Shigang Li

doi:10.1109/access.2019.2962133

Abstract

How an image is represented as the input of a convolutional neural network (CNN) is important because this input directly influences the performance of the CNN. In this paper, we investigate the representation of spherical images by focusing on the inclination estimation of a spherical camera. Unlike other approaches to CNN-based inclination estimation, a spherical image is represented as a geodesic-division-based discrete spherical image (DSI) that is obtained by sampling a sphere as uniformly as possible. The input of the CNN is a single image that consists of five parallelograms flattened from a regular icosahedron. To demonstrate the advantage of the proposed method, comparative experiments are conducted with two other spherical image representations, namely, equirectangular projection (ERP) and cubemap projection (CMP). The experimental results show that the proposed method using a geodesic-division-based discrete spherical image as the CNN input obtains the best performance-better than that of the cubemap and far superior to that of the equirectangular image. The effect of the image representations used becomes more significant as the relative inclination decreases. Moreover, comparative experiments are conducted using the state-of-the-art methods for spherical camera inclination compensation to further illustrate the superiority of the DSI representation. Consequently, the proposed method provides an important reference for the development of CNNs intended for spherical images.

Highlights

Since the precisions of both discrete spherical image (DSI) and cubemap projection (CMP) were above 95%, the convolutional neural network (CNN)-based spherical image inclination estimation task was feasible
The results show that the strategy of training from scratch achieved the worst performance, while the fine-tuned networks based on the pretrained models improved the classification accuracy of the current classification tasks
The results show that the DSI performs the best among the three image representations, followed by the CMP, while the equirectangular projection (ERP) performs the worst

Summary

Introduction

A. BACKGROUND 1) PIN-HOLE CAMERA MODEL VS. SPHERICAL CAMERA MODEL A spherical camera is a camera having the entire field-ofview (FOV). While a conventional camera that captures perspective images originates from the pin-hole camera model, a spherical camera that captures spherical images is represented by the spherical camera model. Spherical images are widely used and have been studied in the fields of medical science, such as representation of the retinal images of the eyes of humans [22], [49], geography, such as representation of the earth [43], meteorology, such as computation of atmospheric motion [42], and computer vision, such as immersive virtual reality [20], [26], [27], visual surveillance [29], augmented reality [25] and robotics [23], [24], [28].

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Discrete Spherical Image Representation for CNN-Based Inclination Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Discrete Spherical Image Representation for CNN‐Based Spherical Camera Pose Estimation
Le Wang ... Shigang Li
IEEJ Transactions on Electrical and Electronic Engineering | VOL. 17
Le Wang, et. al.Le Wang ... Shigang Li
21 Oct 2021
IEEJ Transactions on Electrical and Electronic Engineering | VOL. 17

Reliable Feature Matching for Spherical Images via Local Geometric Rectification and Learned Descriptor
San Jiang ... Yaxin Li
Remote Sensing | VOL. 15
San Jiang, et. al.San Jiang ... Yaxin Li
13 Oct 2023
Remote Sensing | VOL. 15

SMSIR: Spherical Measure Based Spherical Image Representation.
Gang Wu ... Xiaoyan Sun
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 30
Gang Wu, et. al.Gang Wu ... Xiaoyan Sun
01 Jan 2020
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 30

E-CNN: Accurate Spherical Camera Rotation Estimation via Uniformization of Distorted Optical Flow Fields
Dabae Kim ... Sarthak Pathak
-
Dabae Kim, et. al.Dabae Kim ... Sarthak Pathak
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discrete Spherical Image Representation for CNN-Based Inclination Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access