Abstract

Existing 3D stereo networks with 4D volumes are computationally expensive but precise while 2D stereo networks are efficient but poor performance. In this paper, we present a novel group fusion aggregation (GFA) for 2D convolutions cost aggregation based on 4D volumes to reduce computational costs. Group-wise disparity aggregation block (GDAB) and group-wise channel fusion block (GCFB) are proposed to fuse geometry and context information of different group cost volumes in GFA, respectively. Further, we employ channel-dimension-first cost volume transformation and disparity-dimension-first cost volume transformation to convert 4D cost volumes into 3D tensors for GDAB and GCFB input in GFA. We evaluate our method on two popular public benchmark datasets. Experimental results from the KITTI official website show that our method can achieve similar accuracy with other 3D stereo networks (PSMNet, GCNet, GwcNet, etc.) at a low computing consumption. The ablation studies further demonstrate the facticity and reasonability of our proposed GFA.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call