Modality Consistent Generative Adversarial Network for Cross-Modal Retrieval

Zhiyong Wu,Cailing Wang,Xiao-Yuan Jing,Fei Wu,Xiwei Dong,Xiaokai Luo

doi:10.1007/978-3-030-31726-3_1

Zhiyong Wu, Cailing Wang + Show 4 more

https://doi.org/10.1007/978-3-030-31726-3_1

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Cross-modal retrieval, which aims to perform the retrieval task across different modalities of data, is a hot topic. Since different modalities of data have inconsistent distributions, how to reduce the gap of different modalities is the core of cross-modal retrieval issue. Recently, Generative Adversarial Networks has been used in cross-modal retrieval due to its strong ability to model data distribution. We propose a novel approach named Modality Consistent Generative Adversarial Network for cross-modal retrieval (MCGAN). The network integrates a generator to generate synthetic image features from text features, a discriminator to classify the modality of features, and followed by a modality consistent embedding network that projects the generated image features and real image features into a common space for learning the discriminative representations. Experiments on two datasets prove the performance of MCGAN on cross-modal retrieval, compared with state-of-the-art related works.

Full Text