Abstract
Estimating scene depth from a single monocular image is a crucial component in computer vision tasks, enabling many further applications such as robot vision, 3-D modeling, and above all, 2-D to 3-D image/video conversion. Since there are an infinite number of possible world scenes, that can produce a unique image, single image depth estimation is a highly challenging task. This paper tackles such an ambiguous problem by using the merits of both global and local information (structures) of a scene. To this end, we formulate single image depth estimation as a regression problem via (on) rich depth related features which describe effective monocular cues. Exploiting the relationship between these image features and depth values is adopted via a learning model which is inspired by modified stacked generalization scheme. The experiments demonstrate competitive results compared with existing data-driven approaches in both quantitative and qualitative analysis with a remarkably simpler approach than previous works.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE Transactions on Circuits and Systems for Video Technology
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.