Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network

Jingze Chi,Yuxin Peng

doi:10.1109/tcsvt.2019.2900171

Abstract

Existing cross-media retrieval methods are mainly based on the condition where the training set covers all the categories in the testing set, which lack extensibility to retrieve data of new categories. Thus, zero-shot cross-media retrieval has been a promising direction in practical application, aiming to retrieve data of new categories (unseen categories), only with data of limited known categories (seen categories) for training. It is challenging for not only the heterogeneous distributions across different media types, but also the inconsistent semantics across seen and unseen categories need to be handled. To address the above issues, we propose dual adversarial distribution network (DADN) , to learn common embeddings and explore the knowledge from word-embeddings of different categories. The main contributions are as follows. First, zero-shot cross-media dual generative adversarial networks architecture is proposed, in which two kinds of generative adversarial networks (GANs) for common embedding generation and representation reconstruction form dual processes. The dual GANs mutually promote to model semantic and underlying structure information, which generalizes across different categories on heterogeneous distributions and boosts correlation learning. Second, distribution matching with maximum mean discrepancy criterion is proposed to combine with dual GANs, which enhances distribution matching between common embeddings and category word-embeddings. Finally, adversarial inter-media metric constraint is proposed with an inter-media loss and a quadruplet loss, which further model the inter-media correlation information and improve semantic ranking ability. The experiments on four widely used cross-media datasets demonstrate the effectiveness of our DADN approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Mar 2, 2019
Citations: 103

Similar Papers

Dual Adversarial Networks for Zero-shot Cross-media Retrieval
Jingze Chi ... Yuxin Peng
-
Jingze Chi, et. al.Jingze Chi ... Yuxin Peng
01 Jul 2018
01 Jul 2018

One-step multi-view spectral clustering by learning common and specific nonnegative embeddings
Hongwei Yin ... Fanzhang Li
International Journal of Machine Learning and Cybernetics | VOL. 12
Hongwei Yin, et. al.Hongwei Yin ... Fanzhang Li
17 Mar 2021
International Journal of Machine Learning and Cybernetics | VOL. 12

Zero-Shot Cross-Media Retrieval with External Knowledge
Jingze Chi ... Yuxin Peng
-
Jingze Chi, et. al.Jingze Chi ... Yuxin Peng
01 Jan 2018
01 Jan 2018

Alleviating Feature Confusion for Generative Zero-shot Learning
Jingjing Li ... Yang Yang
-
Jingjing Li, et. al.Jingjing Li ... Yang Yang
15 Oct 2019
15 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-Shot Cross-Media Embedding Learning With Dual Adversarial Distribution Network

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology