Parallel learned generative adversarial network with multi-path subspaces for cross-modal retrieval

Zhuoyi Li,Huibin Lu,Hao Fu,Guanghua Gu

doi:10.1016/j.ins.2022.11.087

Abstract

Cross-modal retrieval aims to narrow the heterogeneity gap between different modalities, such as retrieving images through texts or vice versa. One of the key challenges of cross-modal retrieval is the inconsistent distribution across diverse modalities. Most existing methods tend to construct a common representation subspace to overcome the challenge. However, the supervision information is not fully explored in most single-path cross-modal learning approaches. In this paper, we present a novel Parallel Learned generative adversarial network with Multi-path Subspaces (PLMS) for cross-modal retrieval. PLMS is a parallel learned architecture that aims to capture more effective information in an end-to-end trained cross-modal retrieval model. To be specific, a dual-branch network is constructed in the modality-specific generator, thereby the overall framework learns two common subspaces to emphasize discrepant supervision information and preserve more effective transformed features. We further design two objective functions for the training of the dual branches in generators. Through joint training, the feature representations generated by dual branches in a specific modality are fused for similarity measurement between modalities. To avoid redundancy and overlap during fusion, a Multi-source Domain Balancing (MDB) mechanism is presented to explore the contribution of the two specific-task branches. Extensive experiments show that our proposed method is effective and achieves state-of-the-art results on four widely-used databases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallel learned generative adversarial network with multi-path subspaces for cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Nov 21, 2022
Citations: 7

Similar Papers

Self-supervised adversarial learning for cross-modal retrieval
Yangchao Wang ... Xing Xu
-
Yangchao Wang, et. al.Yangchao Wang ... Xing Xu
07 Mar 2021
07 Mar 2021

Coupled dictionary learning and feature mapping for cross-modal retrieval
Xing Xu ... Rin-Ichiro Taniguchi
-
Xing Xu, et. al.Xing Xu ... Rin-Ichiro Taniguchi
01 Jun 2015
01 Jun 2015

Adversarial Cross-Modal Retrieval
Bokun Wang ... Xing Xu
-
Bokun Wang, et. al.Bokun Wang ... Xing Xu
19 Oct 2017
19 Oct 2017

Surface Material Retrieval Using Weakly Paired Cross-Modal Learning
Huaping Liu ... Bin Fang
IEEE Transactions on Automation Science and Engineering | VOL. 16
Huaping Liu, et. al.Huaping Liu ... Bin Fang
01 Apr 2019
IEEE Transactions on Automation Science and Engineering | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallel learned generative adversarial network with multi-path subspaces for cross-modal retrieval

Abstract

Talk to us

Similar Papers

More From: Information Sciences