Abstract

Image-level structural recognition is an important problem for many applications of computer vision such as autonomous vehicle control, scene understanding, and 3D TV. A novel method, using image features extracted by exploiting predefined templates, each associated with individual classifier, is proposed. The template that reflects the symmetric structure consisting of a number of components represents a stage—a rough structure of an image geometry. The following image features are used: a histogram of oriented gradient (HOG) features showing the overall object shape, colors representing scene information, the parameters of the Weibull distribution features, reflecting relations between image statistics and scene structure, and local binary pattern (LBP) and entropy (E) values representing texture and scene depth information. Each of the individual classifiers learns a discriminative model and their outcomes are fused together using sum rule for recognizing the global structure of an image. The proposed method achieves an 86.25% recognition accuracy on the stage dataset and a 92.58% recognition rate on the 15-scene dataset, both of which are significantly higher than the other state-of-the-art methods.

Highlights

  • A human brain can understand images of natural, urban outdoor and indoor 3D scenes efficiently [1,2]

  • Other approaches to scene recognition are based on Bag of Words (BoW) models [8], e.g., J

  • We demonstrate a of image-level structure recognition thatrecognition utilizes image features thesefeatures featuresand are these extracted basedare on novel method of image-level structure that utilizesand image features predefined templates

Read more

Summary

Introduction

A human brain can understand images of natural, urban outdoor and indoor 3D scenes efficiently [1,2] This is because the world around us behaves regularly, and structural regularities are directly reflected in the 2D image scene [3]. Nedovic et al [4] identified several image-level 3D scene geometries on 2D images called ‘stages’. Lou et al [5] use predefined template-based segmentation to extract features, namely histograms of oriented gradients (HOGs) [6], mean color values (Hue, Saturation and Value (HSV) and red (R), green(G), and blue(B)), and parameters of Weibull distribution features [7] for each image patch and introduced a graphical model to learn mapping from image features to a stage.

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call