Abstract
In order to accurately estimate disparities in textureless and slim regions, spatial pyramid pooling and stacked 3D CNN, which can capture global context information, are widely used in state-of-the-art stereo matching algorithms. Unfortunately, the computational complexity and high memory consumption make these methods not friendly to real-time applications such as autonomous driving and augmented realities. In order to balance the real-time performance and accuracy, we design lightweight multi-scale convolutional neural network for real-time stereo matching. First, Lightweight multi-scale 2D and 3D CNN modules are proposed for feature extraction and initial disparity computation respectively. Both of above modules only run on a low resolution to further reduce the amount of calculation. Second, multi-scale RGB images guided network is utilized to refine the final disparity estimation. Experiments on several datasets show that the proposed algorithm can achieve competitive results with speed of 64fps on a NIVDIA 1080 GPU. • The proposed LMNet can balance time efficiency and matching accuracy. • Lightweight MS2D module can extract more global features with less computation. • Lightweight MS3D module yield roubust results especially for slim objects.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.