SENSC algorithm for object and scene categorization

Bao-Di Liu,Yu-Jin Zhang,Cees G M Snoek

doi:10.1117/12.872220

Abstract

So far, the most popular method for object & scene categorization (such as Vector Quantization (VQ), Sparse coding (SC)) transforms low-level descriptors (usually SIFT descriptors) into mid-level representations with more meaningful information. These methods have two key steps: (1) the building dictionary step, which provides a mechanism to map low-level descriptors into mid-level representations. (2) The coding step, which implements the map from low-level descriptors to mid-level representation for each image by the dictionary. In this paper, we proposed to use a stable and efficient nonnegative sparse coding (SENSC) algorithm for building dictionary and coding each image with it to develop an extension of Spatial Pyramid Matching (SPM) method. We also compare SENSC with SC (state-of-the-art performance) method and VQ method, analysis the drawbacks of SC and VQ for building dictionary, and show SENSC algorithm's performance. According to the experiments on three benchmarks (Caltech101, scene, and events), the method we proposed has shown a better performance than SC and VQ methods.

Full Text