3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network

Dianwei Wang,Yongrui Qin,Ying Liu,Zhijie Xu,Yanhui He,Shiqian Wu,Daxiang Li

doi:10.1109/access.2019.2955995

Abstract

This paper addresses the challenge of 3D object detection from a single panoramic image under severe deformation. The advent of the two-stage approach has impelled significant progress in 3D object detection. However, most available methods only can localize region proposals by a single-scale architecture network, which are sensitive to deformation and distortion. To address this issue, we propose a multi-scale convolutional neural network (MSCNN) to estimate the 3D pose of an object. To be specific, the proposed MSCNN consists of three steps for effectively detecting the distorted object on the panoramic images. The MSCNN contains the CycleGAN network that converts rectilinear images into panoramas, a fused framework that improves both accuracy and speed for object detection, and an adversarial spatial transformer network (ASTN) that extracts the deformation features of the object on panoramic images. Additionally, we recover the 3D pose of the object using a coordinate projection and a 3D bounding box. Extensive experiments demonstrate that the proposed method can achieve a 3D detection accuracy of 38.7% in high-resolution panoramic images, which is higher than the current state-of-the-art algorithm of 5.2%. Moreover, the speed of detection is only about 0.6 seconds per image, which is six times faster than Faster R-CNN (COCO). The code will be available at https://github.com/Yanhui-He.

Highlights

The panoramic image visualization platform has enjoyed popularity in many applications, such as virtual reality, visual surveillance, autonomous vehicles and virtual interaction [1]
We propose a new method to detect the objects of 360◦ panoramic imagery using a multi-scale convolutional neural network (MSCNN)
We propose a novel fusion network for 3D object detection with a Multi-scale Convolutional Neural Network (MSCNN), which learns a distortional representation for robust 3D detection localization in panoramic images

Summary

Introduction

The panoramic image visualization platform has enjoyed popularity in many applications, such as virtual reality, visual surveillance, autonomous vehicles and virtual interaction [1]. Panoramic images are typically represented using an equirectangular projection, The associate editor coordinating the review of this manuscript and approving it for publication was Li He. which creates severe geometric distortions for objects that are further from the central horizontal line [3]. Which creates severe geometric distortions for objects that are further from the central horizontal line [3] ERA images create new challenges for computer vision and image processing as i) we lack high-quality annotated 360◦ datasets, ii) imagery is difficult to treat due to its high-resolution and iii) equirectangular projection creates severe geometric distortions for objects away from the central horizontal line [3]. Panoramic images create new challenges for object detection, which is a crucial procedure

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 8	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Recent Advances in 3D Object Detection in the Era of Deep Neural Networks: A Survey.
Mohammad Muntasir Rahman ... Jian Xue
IEEE Transactions on Image Processing | VOL. 29
Mohammad Muntasir Rahman, et. al.Mohammad Muntasir Rahman ... Jian Xue
28 Nov 2019
IEEE Transactions on Image Processing | VOL. 29

U-Select RCNN: An Effective Voxel-based 3D Object Detection Method with Feature Selection Strategy
Zhenghong Zhang ... Lin Zhao
-
Zhenghong Zhang, et. al.Zhenghong Zhang ... Lin Zhao
15 Aug 2022
15 Aug 2022

A novel lesion detection algorithm based on multi-scale input convolutional neural network model for diabetic retinopathy
...
Chinese Journal of Experimental Ophthalmology | VOL. 37
, et. al. ...
10 Aug 2019
Chinese Journal of Experimental Ophthalmology | VOL. 37

3D object detection: Learning 3D bounding boxes from scaled down 2D bounding boxes in RGB-D images
Mohammad Muntasir Rahman ... Ke Lu
Information Sciences | VOL. 476
Mohammad Muntasir Rahman, et. al.Mohammad Muntasir Rahman ... Ke Lu
04 Oct 2018
Information Sciences | VOL. 476

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

3D Object Detection Algorithm for Panoramic Images With Multi-Scale Convolutional Neural Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access