A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data.

Yin Zheng,Hugo Larochelle,Yu-Jin Zhang

doi:10.1109/tpami.2015.2476802

Abstract

Topic modeling based on latent Dirichlet allocation (LDA) has been a framework of choice to deal with multimodal data, such as in image annotation tasks. Another popular approach to model the multimodal data is through deep neural networks, such as the deep Boltzmann machine (DBM). Recently, a new type of topic model called the Document Neural Autoregressive Distribution Estimator (DocNADE) was proposed and demonstrated state-of-the-art performance for text document modeling. In this work, we show how to successfully apply and extend this model to multimodal data, such as simultaneous image classification and annotation. First, we propose SupDocNADE, a supervised extension of DocNADE, that increases the discriminative power of the learned hidden topic features and show how to employ it to learn a joint representation from image visual words, annotation words and class label information. We test our model on the LabelMe and UIUC-Sports data sets and show that it compares favorably to other topic models. Second, we propose a deep extension of our model and provide an efficient way of training the deep model. Experimental results show that our deep model outperforms its shallow version and reaches state-of-the-art performance on the Multimedia Information Retrieval (MIR) Flickr data set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Sep 11, 2015
Citations: 96

Similar Papers

Topic Modeling of Multimodal Data: An Autoregressive Approach
Yin Zheng ... Yu-Jin Zhang
-
Yin Zheng, et. al.Yin Zheng ... Yu-Jin Zhang
01 Jun 2014
01 Jun 2014

Latent topic model for image annotation by modeling topic correlation
Xing Xu ... Atsushi Shimada
-
Xing Xu, et. al.Xing Xu ... Atsushi Shimada
01 Jul 2013
01 Jul 2013

A probabilistic topic model using deep visual word representation for simultaneous image classification and annotation
Seyed Navid Mohammadi Foumani ... Ahmad Nickabadi
Journal of Visual Communication and Image Representation | VOL. 59
Seyed Navid Mohammadi Foumani, et. al.Seyed Navid Mohammadi Foumani ... Ahmad Nickabadi
08 Jan 2019
Journal of Visual Communication and Image Representation | VOL. 59

Correspondence with category Latent Dirichlet Allocation for image annotation
Xiaoxu Li ... Chunxiao Wu
-
Xiaoxu Li, et. al. Xiaoxu Li ... Chunxiao Wu
01 Jul 2011
01 Jul 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Deep and Autoregressive Approach for Topic Modeling of Multimodal Data.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence