Hypergraph clustering based multi-label cross-modal retrieval

Shengtang Guo,Huaxiang Zhang,Li Liu,Dongmei Liu,Xu Lu,Liujian Li

doi:10.1016/j.jvcir.2024.104258

Abstract

Most existing cross-modal retrieval methods face challenges in establishing semantic connections between different modalities due to inherent heterogeneity among them. To establish semantic connections between different modalities and align relevant semantic features across modalities, so as to fully capture important information within the same modality, this paper considers the superiority of hypergraph in representing higher-order relationships, and proposes an image-text retrieval method based on hypergraph clustering. Specifically, we construct hypergraphs to capture feature relationships within image and text modalities, as well as between image and text. This allows us to effectively model complex relationships between features of different modalities and explore the semantic connectivity within and across modalities. To compensate for potential semantic feature loss during the construction of the hypergraph neural network, we design a weight-adaptive coarse and fine-grained feature fusion module for semantic supplementation. Comprehensive experimental results on three common datasets demonstrate the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hypergraph clustering based multi-label cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation

Lead the way for us

Similar Papers

Semantic consistent adversarial cross-modal retrieval exploiting semantic similarity
Weihua Ou ... Ruisheng Xuan
Multimedia Tools and Applications | VOL. 79
Weihua Ou, et. al.Weihua Ou ... Ruisheng Xuan
21 Feb 2019
Multimedia Tools and Applications | VOL. 79

Text modality enhanced based deep hashing for multi-label cross-modal retrieval
Huan Liu ... Jiang Xiong
-
Huan Liu, et. al.Huan Liu ... Jiang Xiong
15 Jul 2022
15 Jul 2022

Cross-modal retrieval based on fusion lightweight network
Ying Liu ... Weidong Zhang
-
Ying Liu, et. al.Ying Liu ... Weidong Zhang
23 Sep 2022
23 Sep 2022

Fine-Grained Correlation Learning with Stacked Co-attention Networks for Cross-Modal Information Retrieval
Yuhang Lu ... Li Guo
-
Yuhang Lu, et. al.Yuhang Lu ... Li Guo
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hypergraph clustering based multi-label cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Journal of Visual Communication and Image Representation