Abstract
Semantic understanding of images remains an important research challenge for the image and video retrieval community. A novel natural scene retrieval method based on non-negative sparse coding is proposed in this paper. It firstly combines non-negative sparse coding with spatial pyramid matching for feature extraction and representation. Then, based on sparse coding, it ranks the Euclidean distances from the query image to each of the K-nearest neighbors in database. With the help of SIFT flow and label transfer, we finally realize the segmentation and recognition for the query images. The experimental results show that the proposed method has higher relevant relationship between the query image and each of the K-nearest neighbors in database than the scene retrieval method based on GIST. And the good performances of our method will be greatly helpful for the following image understanding.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have