Octave Deep Plane-Sweeping Network: Reducing Spatial Redundancy for Learning-Based Plane-Sweeping Stereo

Ren Komatsu,Hiromitsu Fujii,Hajime Asama,Yusuke Tamura,Atsushi Yamashita

doi:10.1109/access.2019.2947195

Ren Komatsu, Hiromitsu Fujii + Show 3 more

Open Access

PDF Available

https://doi.org/10.1109/access.2019.2947195

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

In this paper, we propose the octave deep plane-sweeping network (OctDPSNet). OctDPSNet is a novel learning-based plane-sweeping stereo, which drastically reduces the required GPU memory and computation time while achieving a state-of-the-art depth estimation accuracy. Inspired by octave convolution, we divide image features into high and low spatial frequency features, and two cost volumes are generated from these using our proposed plane-sweeping module. To reduce spatial redundancy, the resolution of the cost volume from the low spatial frequency features is set to half that of the high spatial frequency features, which enables the memory consumption and computational cost to be reduced. After refinement, the two cost volumes are integrated into a final cost volume through our proposed pixel-wise “squeeze-and-excitation” based attention mechanism, and the depth maps are estimated from the final cost volume. We evaluate the proposed model on five datasets: SUN3D, RGB-D SLAM, MVS, Scenes11, and ETH3D. Our model outperforms previous methods on five datasets while drastically reducing the memory consumption and computational cost. Our source code is available at https://github.com/matsuren/octDPSNet .

Highlights

Depth estimation is a fundamental task in the fields of computer vision and robotics, especially for autonomous navigation or autonomous driving, as it is necessary to understand the surrounding environments
Our motivation is that we reduce the resolution of the cost volume to deal with the memory consumption and computational cost problems
To deal with the trade-off between the computation time and accuracy, we focus on reducing the spatial redundancy in a manner inspired by Octave convolution (OctConv) [14]

Summary

Introduction

Depth estimation is a fundamental task in the fields of computer vision and robotics, especially for autonomous navigation or autonomous driving, as it is necessary to understand the surrounding environments. RGB cameras, RGB-D cameras, and LiDAR are commonly employed for depth estimation. RGB cameras are the most popular sensors owing to their low cost, light weight, and availability. Depth estimation from multi-view images has been comprehensively studied over a long period [1]–[5]. One method is plane-sweeping stereo, where multi-view images are projected onto virtual planes at several distances from the reference image plane to generate a cost volume. The depth maps are estimated using this cost volume

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 7	License type: CC BY 4.0

R Discovery Prime

Octave Deep Plane-Sweeping Network: Reducing Spatial Redundancy for Learning-Based Plane-Sweeping Stereo

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Brownout Cloud Characterization Using the Modulation Transfer Function
John Tritschler ... Roberto Celi
Journal of the American Helicopter Society | VOL. 58
John Tritschler, et. al.John Tritschler ... Roberto Celi
01 Jan 2013
Journal of the American Helicopter Society | VOL. 58

Power spectral analysis of surface microtopography formed in CW Laser surface texturing
Nakul D Ghate ... Amber Shrivastava
Procedia Manufacturing | VOL. 53
Nakul D Ghate, et. al.Nakul D Ghate ... Amber Shrivastava
01 Jan 2020
Procedia Manufacturing | VOL. 53

Case study for sampling effect in nanometric surface roughness of ultra-precision grinding
Shaojian Zhang ... Wei Peng
Proceedings of the Institution of Mechanical Engineers, Part E: Journal of Process Mechanical Engineering | VOL. 235
Shaojian Zhang, et. al.Shaojian Zhang ... Wei Peng
19 Aug 2020
Proceedings of the Institution of Mechanical Engineers, Part E: Journal of Process Mechanical Engineering | VOL. 235

A Biomimetic Fingerprint Improves Spatial Tactile Perception
Luke Cramphorn ... Nathan F Lepora
-
Luke Cramphorn, et. al.Luke Cramphorn ... Nathan F Lepora
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Octave Deep Plane-Sweeping Network: Reducing Spatial Redundancy for Learning-Based Plane-Sweeping Stereo

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: IEEE Access