Abstract

Automatic image annotation plays a significant role in image understanding, retrieval, classification, and indexing. Today, it is becoming increasingly important in order to annotate large-scale social media images from content-sharing websites and social networks. These social images are usually annotated by user-provided low-quality tags. The topic model is considered as a promising method to describe these weak-labeling images by learning latent representations of training samples. The recent annotation methods based on topic models have two shortcomings. First, they are difficult to scale to a large-scale image dataset. Second, they can not be used to online image repository because of continuous addition of new images and new tags. In this paper, we propose a novel annotation method based on topic model, namely local learning-based probabilistic latent semantic analysis (LL-PLSA), to solve the above problems. The key idea is to train a weighted topic model for a given test image on its semantic neighborhood consisting of a fixed number of semantically and visually similar images. This method can scale to a large-scale image database, as training samples involved in modeling are a few nearest neighbors rather than the entire database. Moreover, this proposed topic model, online customized for the test image, naturally addresses the issue of continuous addition of new images and new tags in a database. Extensive experiments on three benchmark datasets demonstrate that the proposed method significantly outperforms the state-of-the-art especially in terms of overall metrics.

Highlights

  • Automatic image annotation (AIA) is an important and challenging task in the field of computer vision

  • To overcome the above shortcomings, in this paper, we propose a novel annotation approach based on topic model, namely local learning-based probabilistic latent semantic analysis (PLSA) (LL-PLSA), which aims to improve the semantic level, and reduce complexity of model training

  • WORK We present a novel image annotation based on PLSA model

Read more

Summary

INTRODUCTION

Automatic image annotation (AIA) is an important and challenging task in the field of computer vision. In terms of average precision, the annotation performance of the proposed approach is comparable to MBRM [14] This improved topic model can reduce storage cost, but these SVM classifiers require to be retrained as new labels are added to the database. C. DEEP LEARNING BASED APPROACHES Recently, Convolutional Neural Networks (CNNs) have shown great performance in many computer vision tasks by extracting effective feature vectors from images [21]–[25]. The class label can be considered as the global descriptor of the image, whereas the annotation words can be considered as the local descriptors of the objects in the image To benefit from both semantic and visual features, we extract multi-level features from different layers simultaneously for AIA.

WEIGHTED PLSA MODEL FOR IMAGE ANNOTATION
12: Select top 5 words with the highest distributions of
IMPLEMENTATION DETAILS
RESULTS AND COMPARISON
Findings
CONCLUSION AND FUTURE WORK

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.