Abstract

In this work, we propose a new approach for dense disparity estimation in a global energy minimization framework. We propose to use a feature matching cost which is defined using the learned hierarchical features of given left and right stereo images and we combine it with the pixel-based intensity matching cost in our energy function. Hierarchical features are learned using the deep deconvolutional network which is trained in an unsupervised way using a database consisting of large number of stereo images. In order to perform the regularization, we propose to use the inhomogeneous Gaussian Markov random field (IGMRF) and sparsity priors in our energy function. A sparse autoencoder-based approach is proposed for learning and inferring the sparse representation of disparities. The IGMRF prior captures the smoothness as well as preserves sharp discontinuities while the sparsity prior captures the sparseness in the disparity map. Finally, an iterative two-phase algorithm is proposed to estimate the dense disparity map where in phase one, sparse representation of disparities are inferred from the trained sparse autoencoder, and IGMRF parameters are computed, keeping the disparity map fixed and in phase two, the disparity map is refined by minimizing the energy function using graph cuts, with other parameters fixed. Experimental results on the Middlebury stereo benchmarks demonstrate the effectiveness of the proposed approach.

Highlights

  • Stereo vision has been an active research area in the field of computer vision for more than three decades

  • A two-layer deep deconvolutional network was trained over ms=75 left stereo images obtained from the Middlebury 2005 and 2006 datasets and Middlebury 2014 training dataset [2]

  • 8 Conclusion We have presented a new approach for dense disparity map estimation based on inhomogeneous Markov random field (MRF) and sparsity priors in an energy minimization framework

Read more

Summary

Introduction

Stereo vision has been an active research area in the field of computer vision for more than three decades. Use of adaptive windows [3], multiple windows [4], adaptive weights [5], Nahar and Joshi IPSJ Transactions on Computer Vision and Applications (2017) 9:2 or bilateral filtering [6] in local methods reduce these effects but cannot avoid it completely Global approaches tackle such problems by incorporating regularization such as explicit smoothness assumption and estimate the dense disparity map by minimizing an energy function. In [27], authors proposed unsupervised feature learning for dense stereo matching within a energy minimization framework They learn the features from a large amount of image patches using K-singular value decomposition (K-SVD) dictionary learning approach. In [46], authors proposed a two-layer graphical model for inferring the disparity map by including a sparsity prior over learned sparse representation of disparities in an existing MRF-based stereo matching framework. The experimental results and the performance of the proposed approach are dealt in the “Experimental results” section, and concluding remarks are drawn in the “Conclusion” section

Problem formulation
Deep deconvolutional network for extracting hierarchical features
Feature encoding
Dense disparity estimation
Performance evaluation using different data terms
Method
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call