Have a SNAK. Encoding Spatial Information with the Spatial Non-alignment Kernel

Radu Tudor Ionescu,Marius Popescu

doi:10.1007/978-3-319-23231-7_9

Abstract

AbstractThe standard bag of visual words model model ignores the spatial information contained in the image, but researchers have demonstrated that the object recognition performance can be improved by including spatial information. A state of the art approach is the spatial pyramid representation, which divides the image into spatial bins. In this paper, another general approach that encodes the spatial information in a much better and efficient way is described. The proposed approach is to embed the spatial information into a kernel function termed the Spatial Non-Alignment Kernel (SNAK). For each visual word, the average position and the standard deviation is computed based on all the occurrences of the visual word in the image. These are computed with respect to the center of the object, which is determined with the help of the objectness measure. The pairwise similarity of two images is then computed by taking into account the difference between the average positions and the difference between the standard deviations of each visual word in the two images. In other words, the SNAK kernel includes the spatial distribution of the visual words in the similarity of two images. Furthermore, various kernel functions can be plugged into the SNAK framework. Object recognition experiments are conducted to compare the SNAK framework with the spatial pyramid representation, and to assess the performance improvements for various state of the art kernels on two benchmark data sets. The empirical results indicate that SNAK significantly improves the object recognition performance of every evaluated kernel. Compared to the spatial pyramid, SNAK improves performance while consuming less space and time. In conclusion, SNAK can be considered a good candidate to replace the widely-used spatial pyramid representation.KeywordsKernel methodSpatial informationBag of visual words

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Have a SNAK. Encoding Spatial Information with the Spatial Non-alignment Kernel

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Object Recognition with the Bag of Visual Words Model
Radu Tudor Ionescu ... Marius Popescu
-
Radu Tudor Ionescu, et. al.Radu Tudor Ionescu ... Marius Popescu
01 Jan 2015
01 Jan 2015

Spatial Information in Text Categorization
Radu Tudor Ionescu ... Marius Popescu
-
Radu Tudor Ionescu, et. al.Radu Tudor Ionescu ... Marius Popescu
01 Jan 2015
01 Jan 2015

Response to Letter to the Editor
Michael S Chen ... Deepak L Bhatt
American Heart Journal | VOL. 152
Michael S Chen, et. al.Michael S Chen ... Deepak L Bhatt
25 Oct 2006
American Heart Journal | VOL. 152

PQ kernel: A rank correlation kernel for visual word histograms
Radu Tudor Ionescu ... Marius Popescu
Pattern Recognition Letters | VOL. 55
Radu Tudor Ionescu, et. al.Radu Tudor Ionescu ... Marius Popescu
19 Jun 2014
Pattern Recognition Letters | VOL. 55

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Have a SNAK. Encoding Spatial Information with the Spatial Non-alignment Kernel

Abstract

Talk to us

Similar Papers