Topic Modeling of Multimodal Data: An Autoregressive Approach

Yin Zheng,Hugo Larochelle,Yu-Jin Zhang

doi:10.1109/cvpr.2014.178

Abstract

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply and extend this model to multimodal data, such as simultaneous image classification and annotation. Specifically, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the hidden topic features by incorporating label information into the training objective of the model and show how to employ SupDocNADE to learn a joint representation from image visual words, annotation words and class label information. We also describe how to leverage information about the spatial position of the visual words for SupDocNADE to achieve better performance in a simple, yet effective manner. We test our model on the LabelMe and UIUC-Sports datasets and show that it compares favorably to other topic models such as the supervised variant of LDA and a Spatial Matching Pyramid (SPM) approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Topic Modeling of Multimodal Data: An Autoregressive Approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data.
Yin Zheng ... Yu-Jin Zhang
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 38
Yin Zheng, et. al.Yin Zheng ... Yu-Jin Zhang
11 Sep 2015
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 38

A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation
Seyed Navid Mohammadi Foumani ... Ahmad Nickabadi
Journal of Visual Communication and Image Representation | VOL. 59
Seyed Navid Mohammadi Foumani, et. al.Seyed Navid Mohammadi Foumani ... Ahmad Nickabadi
08 Jan 2019
Journal of Visual Communication and Image Representation | VOL. 59

Latent topic model for image annotation by modeling topic correlation
Xing Xu ... Atsushi Shimada
-
Xing Xu, et. al.Xing Xu ... Atsushi Shimada
01 Jul 2013
01 Jul 2013

Correspondence with category Latent Dirichlet Allocation for image annotation
Xiaoxu Li ... Chunxiao Wu
-
Xiaoxu Li, et. al. Xiaoxu Li ... Chunxiao Wu
01 Jul 2011
01 Jul 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Topic Modeling of Multimodal Data: An Autoregressive Approach

Abstract

Talk to us

Similar Papers