Cross-modal discriminant adversarial network

Peng Hu,Xi Peng,Hongyuan Zhu,Jie Lin,Liangli Zhen,Wei Wang,Dezhong Peng

doi:10.1016/j.patcog.2020.107734

Abstract

Cross-modal retrieval aims at retrieving relevant points across different modalities, such as retrieving images via texts. One key challenge of cross-modal retrieval is narrowing the heterogeneous gap across diverse modalities. To overcome this challenge, we propose a novel method termed as Cross-modal discriminant Adversarial Network (CAN). Taking bi-modal data as a showcase, CAN consists of two parallel modality-specific generators, two modality-specific discriminators, and a Cross-modal Discriminant Mechanism (CDM). To be specific, the generators project diverse modalities into a latent cross-modal discriminant space. Meanwhile, the discriminators compete against the generators to alleviate the heterogeneous discrepancy in this space, i.e., the generators try to generate unified features to confuse the discriminators, and the discriminators aim to classify the generated results. To further remove the redundancy and preserve the discrimination, we propose CDM to project the generated results into a single common space, accompanying with a novel eigenvalue-based loss. Thanks to the eigenvalue-based loss, CDM could push as much discriminative power as possible into all latent directions. To demonstrate the effectiveness of our CAN, comprehensive experiments are conducted on four multimedia datasets comparing with 15 state-of-the-art approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Pattern Recognition	Publication Date: Nov 5, 2020
Citations: 16	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Cross-modal discriminant adversarial network

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Modeling intra- and inter-pair correlation via heterogeneous high-order preserving for cross-modal retrieval
Leiquan Wang ... Fei Su
Signal Processing | VOL. 131
Leiquan Wang, et. al.Leiquan Wang ... Fei Su
11 Aug 2016
Signal Processing | VOL. 131

Aknowledgement
-
Journal of Economic Behavior and Organization | VOL. 4
--
01 Dec 1983
Journal of Economic Behavior and Organization | VOL. 4

Category supervised cross-modal hashing retrieval for chest X-ray and radiology reports
Yong Zhang ... Jiaxin Deng
Computers & Electrical Engineering | VOL. 98
Yong Zhang, et. al.Yong Zhang ... Jiaxin Deng
29 Jan 2022
Computers & Electrical Engineering | VOL. 98

Deep Adversarial Cascaded Hashing for Cross-Modal Vessel Image Retrieval
Jiaen Guo ... Xin Guan
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 16
Jiaen Guo, et. al.Jiaen Guo ... Xin Guan
01 Jan 2023
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-modal discriminant adversarial network

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition