Rethinking 3D cost aggregation in stereo matching

Wanshui Gan,Wenhao Wu,Shifeng Chen,Yuxiang Zhao,Pak Kin Wong

doi:10.1016/j.patrec.2023.02.011

Abstract

In the stereo matching task, the 3D convolution network can effectively aggregate the cost volume with the strong representation ability to model the spatial and depth dimensions but with the disadvantage of a high computational cost. In this letter, we revisit the 3D convolution network and its common variant, and then propose the Depth Shift Module (DSM) to model the cost volume in the depth dimension which could imitate the 3D convolution function with the computational complexity of the 2D convolution. The proposed DSM is easy to extend to present 3D cost aggregation methods in stereo matching with less inference time, lower computational complexity, and minor precision loss. Moreover, a novel compact but efficient stereo matching framework named HybridNet is proposed. This framework can hybridize the 2D convolution layer with the proposed DSM to effectively aggregate the cost volume. The proposed HybridNet achieves a better trade-off between the performance, computational complexity, and model size (e.g., 30% less than the size of AANet and 25% less than the size of PSMNet) in public open-source datasets (e.g., Scene Flow and KITTI Stereo 2015). The relevant code is available at https://github.com/GANWANSHUI/HybridNet.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rethinking 3D cost aggregation in stereo matching

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Feb 7, 2023
Citations: 6

Similar Papers

Hybrid network model based on 3D convolutional neural network and scalable graph convolutional network for hyperspectral image classification
Xili Wang ... Zhengyin Liang
IET Image Processing | VOL. 17
Xili Wang, et. al.Xili Wang ... Zhengyin Liang
25 Sep 2022
IET Image Processing | VOL. 17

Mixed 2D and 3D convolutional network with multi-scale context for lesion segmentation in breast DCE-MRI
Hongyu Wang ... Baoying Chen
Biomedical Signal Processing and Control | VOL. 68
Hongyu Wang, et. al.Hongyu Wang ... Baoying Chen
05 Apr 2021
Biomedical Signal Processing and Control | VOL. 68

Fast Multi-Scale Residual Fusion Network for Stereo Matching
Zijing Huang ... Wangduo Xie
-
Zijing Huang, et. al.Zijing Huang ... Wangduo Xie
05 Jul 2021
05 Jul 2021

Accurate and Efficient Stereo Matching via Attention Concatenation Volume.
Gangwei Xu ... Xin Yang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 46
Gangwei Xu, et. al.Gangwei Xu ... Xin Yang
01 Apr 2024
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 46

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rethinking 3D cost aggregation in stereo matching

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters