Image Classification with the Fisher Vector: Theory and Practice

Jorge Sánchez,Thomas Mensink,Florent Perronnin,Jakob Verbeek

doi:10.1007/s11263-013-0636-x

Abstract

A standard approach to describe an image for classification and retrieval purposes is to extract a set of local patch descriptors, encode them into a high dimensional vector and pool them into an image-level signature. The most common patch encoding strategy consists in quantizing the local descriptors into a finite set of prototypical elements. This leads to the popular Bag-of-Visual words representation. In this work, we propose to use the Fisher Kernel framework as an alternative patch encoding strategy: we describe patches by their deviation from an “universal” generative Gaussian mixture model. This representation, which we call Fisher vector has many advantages: it is efficient to compute, it leads to excellent results even with efficient linear classifiers, and it can be compressed with a minimal loss of accuracy using product quantization. We report experimental results on five standard datasets—PASCAL VOC 2007, Caltech 256, SUN 397, ILSVRC 2010 and ImageNet10K—with up to 9M images and 10K classes, showing that the FV framework is a state-of-the-art patch encoding technique.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computer Vision	Publication Date: Jun 12, 2013
Citations: 1511	License type: cc-by-nc-sa

R Discovery Prime

R Discovery Prime

Image Classification with the Fisher Vector: Theory and Practice

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Similar Papers

Deep FisherNet for Image Classification.
Peng Tang ... Wenyu Liu
IEEE Transactions on Neural Networks and Learning Systems | VOL. 30
Peng Tang, et. al.Peng Tang ... Wenyu Liu
05 Nov 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 30

Probability Loop Closure Detection with Fisher Kernel Framework for Visual SLAM
Ge Zhang ... Qian Zuo
-
Ge Zhang, et. al.Ge Zhang ... Qian Zuo
01 Jan 2021
01 Jan 2021

Compact Representation of High-Dimensional Feature Vectors for Large-Scale Image Recognition and Retrieval.
Yu Zhang ... Jianxin Wu
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 25
Yu Zhang, et. al.Yu Zhang ... Jianxin Wu
01 May 2016
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 25

Predicting a Cold from Speech Using Fisher Vectors; SVM and XGBoost as Classifiers
José Vicente Egas-López ... Gábor Gosztolya
-
José Vicente Egas-López, et. al.José Vicente Egas-López ... Gábor Gosztolya
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image Classification with the Fisher Vector: Theory and Practice

Abstract

Talk to us

Similar Papers

More From: International Journal of Computer Vision