Using Multiple Segmentations to Discover Objects and their Extent in Image Collections

B.C Russell,W.T Freeman,J Sivic,A.A Efros,A Zisserman

doi:10.1109/cvpr.2006.326

Abstract

Given a large dataset of images, we seek to automatically determine the visually similar object and scene classes together with their image segmentation. To achieve this we combine two ideas: (i) that a set of segmented objects can be partitioned into visual object classes using topic discovery models from statistical text analysis; and (ii) that visual object classes can be used to assess the accuracy of a segmentation. To tie these ideas together we compute multiple segmentations of each image and then: (i) learn the object classes; and (ii) choose the correct segmentations. We demonstrate that such an algorithm succeeds in automatically discovering many familiar objects in a variety of image datasets, including those from Caltech, MSRC and LabelMe.

Highlights

In [21] we posed the question, given a (Gargantuan) number of images, “Is it possible to learn visual object classes from looking at images?”
Images are treated as documents, with each image being represented by a histogram of visual words
In this paper we propose to use image segmentation as a way to utilize visual grouping cues to produce groups of related visual words

Summary

Introduction

In [21] we posed the question, given a (Gargantuan) number of images, “Is it possible to learn visual object classes from looking at images?”. Some success has been reported in discovering object and scene categories [7, 17, 21] by borrowing tools from the statistical text analysis community These tools, such as probabilistic Latent Semantic Analysis (pLSA) [12] and Latent Dirichlet Allocation (LDA) [2], use unordered “bag of words” representation of documents to automatically discover topics in a large text corpus. To map these techniques onto the visual domain, an equivalent notion of a text word needs to be defined. Applying topic discovery to such a representation is successful in classifying the image, but the resulting object segmentations are “soft” – the discovered objects (or scenes) are shown by highlighting the visual

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Multiple Segmentations to Discover Objects and their Extent in Image Collections

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jun 17, 2006
Citations: 579	License type: cc-by

Similar Papers

Multi-view Object Categorization and Pose Estimation
Silvio Savarese ... Li Fei-Fei
-
Silvio Savarese, et. al.Silvio Savarese ... Li Fei-Fei
01 Jan 2009
01 Jan 2009

Dynamic MLML-tree based adaptive object detection using heterogeneous data distribution
Dong Kyun Shin ... Yeong Hyeon Kim
Multimedia Tools and Applications | VOL. 79
Dong Kyun Shin, et. al.Dong Kyun Shin ... Yeong Hyeon Kim
17 Dec 2019
Multimedia Tools and Applications | VOL. 79

Knowledge-Based Topic Model for Unsupervised Object Discovery and Localization.
Zhenxing Niu ... Xinbo Gao
IEEE Transactions on Image Processing | VOL. 27
Zhenxing Niu, et. al.Zhenxing Niu ... Xinbo Gao
01 Jan 2018
IEEE Transactions on Image Processing | VOL. 27

GAN based approaches for self-supervised segmentation: A comparative study
Zohair Elmourabit ... Oumayma Banouar
Statistics, Optimization & Information Computing | VOL. 12
Zohair Elmourabit, et. al.Zohair Elmourabit ... Oumayma Banouar
21 Feb 2024
Statistics, Optimization & Information Computing | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Multiple Segmentations to Discover Objects and their Extent in Image Collections

Abstract

Highlights

Summary

Talk to us

Similar Papers