Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval

Xiao Shen,Haofeng Zhang,Lunbo Li,Zheng Zhang,Debao Chen,Li Liu

doi:10.1016/j.neucom.2021.06.087

Abstract

With the advent of the big data era, multimedia data is growing rapidly, and its data modalities is also becoming diversified. Therefore, the demand for the speed and accuracy of cross-modal information retrieval is increasing. Hashing-based cross-modal retrieval technology attracts widespread attention, it encodes multimedia data into a common binary hash space, thereby effectively measuring the correlation between samples from different modalities. In this paper, we propose a novel end-to-end deep cross-modal retrieval framework, namely Clustering-driven Deep Adversarial Hashing (CDAH), which has three main characteristics. Firstly, CDAH learns discriminative clusters recursively through a soft clustering model. It attempts to generate modal-invariant representations in a common space by obfuscating the modality classifier, which tries to distinguish different modalities according to the generated representations. Secondly, in order to minimize the modal gap between feature representations from different modalities with the same semantic label, and to maximize the distance between images and texts with different labels, CDAH constructs a fused-semantics matrix to integrate the original domain information from different modalities, serving as self-supervised information to refine the binary codes. Finally, CDAH skillfully uses a scaled tanh function to adaptively learn the binary codes, which will gradually converge to the original tricky binary coding problem. We conduct comprehensive experiments on four popular datasets, and the experimental results demonstrate the superiority of our model against the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Jun 29, 2021
Citations: 12

Similar Papers

Cross-modal retrieval of remote sensing images and text based on self-attention unsupervised deep common feature space
Qilin Ding ... Weipeng Li
International Journal of Remote Sensing | VOL. 44
Qilin Ding, et. al.Qilin Ding ... Weipeng Li
18 Jun 2023
International Journal of Remote Sensing | VOL. 44

Modeling intra- and inter-pair correlation via heterogeneous high-order preserving for cross-modal retrieval
Leiquan Wang ... Fei Su
Signal Processing | VOL. 131
Leiquan Wang, et. al.Leiquan Wang ... Fei Su
11 Aug 2016
Signal Processing | VOL. 131

Aknowledgement
-
Journal of Economic Behavior and Organization | VOL. 4
--
01 Dec 1983
Journal of Economic Behavior and Organization | VOL. 4

Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval
Shengsheng Qian ... Quan Fang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Shengsheng Qian, et. al.Shengsheng Qian ... Quan Fang
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Neurocomputing