Considering the Spatial Layout Information of Bag of Features (BoF) Framework for Image Classification.

Guangyu Mu,Limin Wang,Ying Liu

doi:10.1371/journal.pone.0131164

Guangyu Mu, Limin Wang + Show 1 more

Open Access

https://doi.org/10.1371/journal.pone.0131164

Copy DOI

Journal: PloS one	Publication Date: Jun 29, 2015
Citations: 4	License type: CC BY 4.0

Affiliation: Jilin University of Finance and Economics

Abstract

The spatial pooling method such as spatial pyramid matching (SPM) is very crucial in the bag of features model used in image classification. SPM partitions the image into a set of regular grids and assumes that the spatial layout of all visual words obey the uniform distribution over these regular grids. However, in practice, we consider that different visual words should obey different spatial layout distributions. To improve SPM, we develop a novel spatial pooling method, namely spatial distribution pooling (SDP). The proposed SDP method uses an extension model of Gauss mixture model to estimate the spatial layout distributions of the visual vocabulary. For each visual word type, SDP can generate a set of flexible grids rather than the regular grids from the traditional SPM. Furthermore, we can compute the grid weights for visual word tokens according to their spatial coordinates. The experimental results demonstrate that SDP outperforms the traditional spatial pooling methods, and is competitive with the state-of-the-art classification accuracy on several challenging image datasets.

Highlights

Image classification plays a significant role in the computer vision research
Empirical results show that spatial pyramid matching (SPM) can significantly improve the classification performance, it assumes that the spatial layout of all visual words obey the uniform distribution over these regular grids
We develop a novel spatial distribution pooling (SDP) algorithm to improve the spatial pooling in the bag of words (BoW) model for image classification

Summary

Introduction

Image classification plays a significant role in the computer vision research. The recent stateof-the-art image classification pipeline consists of two major parts: 1) the image representation, e.g., bag of features (BoF) [1,2,3] and spatial pyramid matching (SPM) [4]; 2) the classifier, e.g., support vector machines (SVMs) and its variants [5, 6]. Empirical results show that SPM can significantly improve the classification performance, it assumes that the spatial layout of all visual words obey the uniform distribution over these regular grids. SPM rigidly partitions the image into several regular grids, and assumes that the spatial layout of all visual words obey the uniform distribution over these grids. Each visual word in SDP occurs in the regular grids in each level following equal probability This generates a conflict to the intuition that different visual words should obey different spatial layout distributions. Under e-GMM, SDP can assign each visual word to a latent grid according to its spatial coordinate, instead of a regular grid. The inferential problem is to compute the posterior distribution of the grid assignment given a visual word v with spatial coordinate c!v

Related work

Experiments with Parameters

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Considering the Spatial Layout Information of Bag of Features (BoF) Framework for Image Classification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Feedback-based Dynamically Weighted BoF for Image Retrieval
Li Li ... Yingqian Jia
-
Li Li, et. al.Li Li ... Yingqian Jia
01 Jan 2015
01 Jan 2015

Improved Spatial Pyramid Matching for Sports Image Classification
Yue Gao ... Kazuki Katagishi
-
Yue Gao, et. al.Yue Gao ... Kazuki Katagishi
01 Feb 2016
01 Feb 2016

Compact and discriminative representation of Bag-of-Features
Jiangtao Cui ... Guangxin Li
Neurocomputing | VOL. 169
Jiangtao Cui, et. al.Jiangtao Cui ... Guangxin Li
28 May 2015
Neurocomputing | VOL. 169

Semantic-Spatial Matching for image classification
Yupeng Yan ... Yijuan Lu
-
Yupeng Yan, et. al. Yupeng Yan ... Yijuan Lu
01 Jul 2013
01 Jul 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Considering the Spatial Layout Information of Bag of Features (BoF) Framework for Image Classification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one