Fine-grained bidirectional attentional generation and knowledge-assisted networks for cross-modal retrieval

Jianwei Zhu,Huifang Ma,Jiahui Wei,Zhixin Li,Yufei Zeng

doi:10.1016/j.imavis.2022.104507

Abstract

Generally, most existing cross-modal retrieval methods only consider global or local semantic embeddings, lacking fine-grained dependencies between objects. At the same time, it is usually ignored that the mutual transformation between modalities also facilitates the embedding of modalities. Given these problems, we propose a method called BiKA (Bidirectional Knowledge-assisted embedding and Attention-based generation). The model uses a bidirectional graph convolutional neural network to establish dependencies between objects. In addition, it employs a bidirectional attention-based generative network to achieve the mutual transformation between modalities. Specifically, the knowledge graph is used for local matching to constrain the local expression of the modalities, in which the generative network is used for mutual transformation to constrain the global expression of the modalities. In addition, we also propose a new position relation embedding network to embed position relation information between objects. The experiments on two public datasets show that the performance of our method has been dramatically improved compared to many state-of-the-art models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fine-grained bidirectional attentional generation and knowledge-assisted networks for cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: Aug 1, 2022
Citations: 4

Similar Papers

Image-Text Matching with Fine-Grained Relational Dependency and Bidirectional Attention-Based Generative Networks
Jianwei Zhu ... Zhixin Li
-
Jianwei Zhu, et. al.Jianwei Zhu ... Zhixin Li
10 Oct 2022
10 Oct 2022

Improved Collaborative Recommendation Model: Integrating Knowledge Embedding and Graph Contrastive Learning
Liwei Jiang ... Guanghui Yan
Electronics | VOL. 12
Liwei Jiang, et. al.Liwei Jiang ... Guanghui Yan
13 Oct 2023
Electronics | VOL. 12

An Efficient Recommendation Algorithm Integrating Knowledge Graph with Graph Convolutional Networks
Changzheng Xing ... Jialong Guo
-
Changzheng Xing, et. al.Changzheng Xing ... Jialong Guo
01 Feb 2023
01 Feb 2023

Aspect-level sentiment analysis merged with knowledge graph and graph convolutional neural network
Zuhua Dai ... Shilong Di
Journal of Physics: Conference Series | VOL. 2083
Zuhua Dai, et. al.Zuhua Dai ... Shilong Di
01 Nov 2021
Journal of Physics: Conference Series | VOL. 2083

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fine-grained bidirectional attentional generation and knowledge-assisted networks for cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing