Web Image Research Articles

Unsupervised object discovery and localization is to discover some dominant object classes and localize all of object instances from a given image collection without any supervision. Previous work has attempted to tackle this problem with vanilla topic models, such as latent Dirichlet allocation (LDA). However, in those methods no prior knowledge for the given image collection is exploited to facilitate object discovery. On the other hand, the topic models used in those methods suffer from the topic coherence issue-some inferred topics do not have clear meaning, which limits the final performance of object discovery. In this paper, prior knowledge in terms of the so-called must-links are exploited from Web images on the Internet. Furthermore, a novel knowledge-based topic model, called LDA with mixture of Dirichlet trees, is proposed to incorporate the must-links into topic modeling for object discovery. In particular, to better deal with the polysemy phenomenon of visual words, the must-link is re-defined as that one must-link only constrains one or some topic(s) instead of all topics, which leads to significantly improved topic coherence. Moreover, the must-links are built and grouped with respect to specific object classes, thus the must-links in our approach are semantic-specific, which allows to more efficiently exploit discriminative prior knowledge from Web images. Extensive experiments validated the efficiency of our proposed approach on several data sets. It is shown that our method significantly improves topic coherence and outperforms the unsupervised methods for object discovery and localization. In addition, compared with discriminative methods, the naturally existing object classes in the given image collection can be subtly discovered, which makes our approach well suited for realistic applications of unsupervised object discovery.Unsupervised object discovery and localization is to discover some dominant object classes and localize all of object instances from a given image collection without any supervision. Previous work has attempted to tackle this problem with vanilla topic models, such as latent Dirichlet allocation (LDA). However, in those methods no prior knowledge for the given image collection is exploited to facilitate object discovery. On the other hand, the topic models used in those methods suffer from the topic coherence issue-some inferred topics do not have clear meaning, which limits the final performance of object discovery. In this paper, prior knowledge in terms of the so-called must-links are exploited from Web images on the Internet. Furthermore, a novel knowledge-based topic model, called LDA with mixture of Dirichlet trees, is proposed to incorporate the must-links into topic modeling for object discovery. In particular, to better deal with the polysemy phenomenon of visual words, the must-link is re-defined as that one must-link only constrains one or some topic(s) instead of all topics, which leads to significantly improved topic coherence. Moreover, the must-links are built and grouped with respect to specific object classes, thus the must-links in our approach are semantic-specific, which allows to more efficiently exploit discriminative prior knowledge from Web images. Extensive experiments validated the efficiency of our proposed approach on several data sets. It is shown that our method significantly improves topic coherence and outperforms the unsupervised methods for object discovery and localization. In addition, compared with discriminative methods, the naturally existing object classes in the given image collection can be subtly discovered, which makes our approach well suited for realistic applications of unsupervised object discovery.

Read full abstract

With the rapid development of information technology, the capacity of Web images database becomes larger and larger. How to quickly and effectively find the desired images in Web image databases becomes the challenge needed to resolve with high priority. In this paper a novel diagonal texture structure descriptor (DTSD) is proposed, and a new framework considering hue, saturation and value components is utilized for image retrieval. In specific, we firstly use Otsu algorithm to segment image into foreground and background, and the features of multi-regions are respectively considered. That is, we present the contents of these multi-regions distinctively to reduce the influence of each other, which would perform hierarchical feature description and realize more accurate content match for image retrieval. In this study, to simulate the characteristic of human eyes for perceiving colors, hue and saturation components are quantized into various bins which can obtain more detailed description for color difference. Meanwhile, DTSD is extracted based on value component to represent the edge information as the feature of receptive field. Such a method can improve the spatial resolution ability of the descriptor, and identify finer structure of an image. Moreover, histogram with respect to these three components, i.e., hue, saturation and value, is utilized to generate the feature vector of an image. We carry out the experiments on benchmark Corel and UCID image datasets, and the extensive experimental results demonstrate that our method achieves better performance in comparison with state of the art image retrieval algorithms. The proposed method is very promising, which can provide more accurate retrieved results on the basis of color & texture descriptions in multi-regions, and further enhances the performance of the intelligent image retrieval system.

Read full abstract

Web Image Research Articles

Related Topics

Articles published on Web Image

Affective image classification via semi-supervised learning from web images

What can we learn about the female condom online? An analysis of visual representations of the female condom on the Internet

Deepdiary: Lifelogging image captioning and summarization

Large-scale k-means clustering via variance reduction

Extracting Key Segments of Videos for Event Detection by Learning From Web Sources

STUDY OF SOCIAL IMAGE RE-RANKING ACCORDING INTER AND INTRA USER IMPACT

Discovering and Distinguishing Multiple Visual Senses for Polysemous Words

Web image re-ranking using query specific in cloud computing

Creating personalized video summaries via semantic event detection

Hypergraph dominant set based multi-video summarization

A recursive framework for expression recognition: from web images to deep models to game dataset

CISRDCNN: Super-resolution of compressed images using deep convolutional neural networks

Knowledge-Based Topic Model for Unsupervised Object Discovery and Localization.

Decision Support for Grape Crop Protection Using Ontology

A Benchmark Dataset and Learning High-Level Semantic Embeddings of Multimedia for Cross-Media Retrieval

Image-Matching Based Identification of Store Signage Using Web-Crawled Information

Clustering of near duplicate images using bundled features

Top-Down Neural Attention by Excitation Backprop

Accessible images (AIMS): a model to build self-describing images for assisting screen reader users

Taking advantage of multi-regions-based diagonal texture structure descriptor for image retrieval

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Web Image Research Articles

Related Topics

Articles published on Web Image

Affective image classification via semi-supervised learning from web images

What can we learn about the female condom online? An analysis of visual representations of the female condom on the Internet

Deepdiary: Lifelogging image captioning and summarization

Large-scale k-means clustering via variance reduction

Extracting Key Segments of Videos for Event Detection by Learning From Web Sources

STUDY OF SOCIAL IMAGE RE-RANKING ACCORDING INTER AND INTRA USER IMPACT

Discovering and Distinguishing Multiple Visual Senses for Polysemous Words

Web image re-ranking using query specific in cloud computing

Creating personalized video summaries via semantic event detection

Hypergraph dominant set based multi-video summarization

A recursive framework for expression recognition: from web images to deep models to game dataset

CISRDCNN: Super-resolution of compressed images using deep convolutional neural networks

Knowledge-Based Topic Model for Unsupervised Object Discovery and Localization.

Decision Support for Grape Crop Protection Using Ontology

A Benchmark Dataset and Learning High-Level Semantic Embeddings of Multimedia for Cross-Media Retrieval

Image-Matching Based Identification of Store Signage Using Web-Crawled Information

Clustering of near duplicate images using bundled features

Top-Down Neural Attention by Excitation Backprop

Accessible images (AIMS): a model to build self-describing images for assisting screen reader users

Taking advantage of multi-regions-based diagonal texture structure descriptor for image retrieval