Abstract

So far, the most popular method for object & scene categorization (such as Vector Quantization (VQ), Sparse coding (SC)) transforms low-level descriptors (usually SIFT descriptors) into mid-level representations with more meaningful information. These methods have two key steps: (1) the building dictionary step, which provides a mechanism to map low-level descriptors into mid-level representations. (2) The coding step, which implements the map from low-level descriptors to mid-level representation for each image by the dictionary. In this paper, we proposed to use a stable and efficient nonnegative sparse coding (SENSC) algorithm for building dictionary and coding each image with it to develop an extension of Spatial Pyramid Matching (SPM) method. We also compare SENSC with SC (state-of-the-art performance) method and VQ method, analysis the drawbacks of SC and VQ for building dictionary, and show SENSC algorithm's performance. According to the experiments on three benchmarks (Caltech101, scene, and events), the method we proposed has shown a better performance than SC and VQ methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.