Coordinated and specific restricted Boltzmann machine for cross-modal retrieval

Menghan Xu,Fangxiang Feng,Bo Sun,Jing Jiang

doi:10.1117/12.2639886

Abstract

With the rapid growth of multimodal web data, the task of cross-modal retrieval, i.e., using a text query to search for images or vice versa, has attracted a lot of attention from researchers. Existing approaches usually learn a common representation space where different modalities can be directly compared. However, little work has been done to verify that the learned common representation space contains only common part shared between different modalities. In this paper, we present a coordinated and specific restricted Boltzmann machine (a.k.a. CSRBM) that can distinguish the common part from modality-specific part of different modalities. The proposed CSRBM consists of two RBMs, each with two hidden layers. The common hidden layer learns the common patterns shared within different modalities. And the modality-specific hidden layer learns the modality-specific patterns owned by individual modalities. To verify the split effectiveness of our proposed model, we construct a multimodal dataset based on the popular MNIST dataset. Moreover, we evaluate our model on three publicly real-world datasets with the task of cross-modal retrieval. The extensive experiments demonstrate the effectiveness of our CSRBM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Coordinated and specific restricted Boltzmann machine for cross-modal retrieval

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Deep Supervised Dual Cycle Adversarial Network for Cross-Modal Retrieval
Lei Liao ... Bob Zhang
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Lei Liao, et. al.Lei Liao ... Bob Zhang
01 Feb 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Deep Adversarial Cascaded Hashing for Cross-Modal Vessel Image Retrieval
Jiaen Guo ... Xin Guan
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 16
Jiaen Guo, et. al.Jiaen Guo ... Xin Guan
01 Jan 2023
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 16

OTCMR: Bridging Heterogeneity Gap with Optimal Transport for Cross-modal Retrieval
Mingyang Li ... Shao-Lun Huang
-
Mingyang Li, et. al.Mingyang Li ... Shao-Lun Huang
26 Oct 2021
26 Oct 2021

Coordinated and specific autoencoder for cross-modal retrieval
Menghan Xu ... Bo Sun
-
Menghan Xu, et. al.Menghan Xu ... Bo Sun
10 Nov 2022
10 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Coordinated and specific restricted Boltzmann machine for cross-modal retrieval

Abstract

Talk to us

Similar Papers