Unsupervised Cross-Modal Hashing With Modality-Interaction

Rong-Cheng Tu,Jie Jiang,Qinghong Lin,Hongfa Wang,Wei Liu,Shangxuan Tian,Chengfei Cai

doi:10.1109/tcsvt.2023.3251395

Abstract

Recently, numerous unsupervised cross-modal hashing methods have been proposed to deal the image-text retrieval tasks for the unlabeled cross-modal data. However, when these methods learn to generate hash codes, almost all of them lack modality-interaction in the following two aspects: (1) The instance similarity matrix used to guide the hashing networks training is constructed without image-text interaction, which fails to capture the fine-grained cross-modal cues to elaborately characterize the intrinsic semantic similarity among the datapoints. (2) The binary codes used for quantization loss are inferior because they are generated by directly quantizing a simple combination of continuous hash codes from different modalities without the interaction among these continuous hash codes. Such problems will cause the generated hash codes to be of poor quality and degrade the retrieval performance. Hence, in this paper, we propose a novel Unsupervised Cross-modal Hashing with Modality-interaction, termed UCHM. Specifically, by optimizing a novel hash-similarity-friendly loss, a modality-interaction-enabled (MIE) similarity generator is first trained to generate a superior MIE similarity matrix for the training set. Then, the generated MIE similarity matrix is utilized as guiding information to train the deep hashing networks. Furthermore, during the process of training the hashing networks, a novel bit-selection module is proposed to generate high-quality unified binary codes for the quantization loss with the interaction among continuous codes from different modalities, thereby further enhancing the retrieval performance. Extensive experiments on two widely used datasets show that the proposed UCHM outperforms state-of-the-art techniques on cross-modal retrieval tasks.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Cross-Modal Hashing With Modality-Interaction

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Sep 1, 2023
Citations: 13

Similar Papers

Unsupervised Cross-Modal Hashing via Semantic Text Mining
Rong-Cheng Tu ... Weize Qin
IEEE Transactions on Multimedia | VOL. 25
Rong-Cheng Tu, et. al.Rong-Cheng Tu ... Weize Qin
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

Deep Cross-modal Proxy Hashing
Rong-Cheng Tu ... Heyan Huang
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Rong-Cheng Tu, et. al.Rong-Cheng Tu ... Heyan Huang
01 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Unsupervised Deep Fusion Cross-modal Hashing
Jiaming Huang ... Liping Jing
-
Jiaming Huang, et. al.Jiaming Huang ... Liping Jing
14 Oct 2019
14 Oct 2019

Learning From Expert: Vision-Language Knowledge Distillation for Unsupervised Cross-Modal Hashing Retrieval
Lina Sun ... Yumin Dong
-
Lina Sun, et. al.Lina Sun ... Yumin Dong
12 Jun 2023
12 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Cross-Modal Hashing With Modality-Interaction

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society