Zero-Shot Learning via Latent Space Encoding.

Yunlong Yu,Jichang Guo,Zhongfei Zhang,Zhong Ji

doi:10.1109/tcyb.2018.2850750

Abstract

Zero-shot learning (ZSL) is typically achieved by resorting to a class semantic embedding space to transfer the knowledge from the seen classes to unseen ones. Capturing the common semantic characteristics between the visual modality and the class semantic modality (e.g., attributes or word vector) is a key to the success of ZSL. In this paper, we propose a novel encoder-decoder approach, namely latent space encoding (LSE), to connect the semantic relations of different modalities. Instead of requiring a projection function to transfer information across different modalities like most previous work, LSE performs the interactions of different modalities via a feature aware latent space, which is learned in an implicit way. Specifically, different modalities are modeled separately but optimized jointly. For each modality, an encoder-decoder framework is performed to learn a feature aware latent space via jointly maximizing the recoverability of the original space from the latent space and the predictability of the latent space from the original space. To relate different modalities together, their features referring to the same concept are enforced to share the same latent codings. In this way, the common semantic characteristics of different modalities are generalized with the latent representations. Another property of the proposed approach is that it is easily extended to more modalities. Extensive experimental results on four benchmark datasets [animal with attribute, Caltech UCSD birds, aPY, and ImageNet] clearly demonstrate the superiority of the proposed approach on several ZSL tasks, including traditional ZSL, generalized ZSL, and zero-shot retrieval.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Zero-Shot Learning via Latent Space Encoding.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics

Lead the way for us

Journal: IEEE Transactions on Cybernetics	Publication Date: Jul 16, 2018
Citations: 101

Similar Papers

Modality independent adversarial network for generalized zero shot image classification
Haofeng Zhang ... Ling Shao
Neural Networks | VOL. 134
Haofeng Zhang, et. al.Haofeng Zhang ... Ling Shao
21 Nov 2020
Neural Networks | VOL. 134

Learning an enhanced consensus representation for multi-view clustering via latent representation correlation preserving
Zhongyan Gui ... Zhiqiang Xie
Knowledge-Based Systems | VOL. 253
Zhongyan Gui, et. al.Zhongyan Gui ... Zhiqiang Xie
22 Jul 2022
Knowledge-Based Systems | VOL. 253

An end-to-end deep generative approach with meta-learning optimization for zero-shot object classification
Xiaofeng Xu ... Guifu Lu
Information Processing & Management | VOL. 60
Xiaofeng Xu, et. al.Xiaofeng Xu ... Guifu Lu
16 Dec 2022
Information Processing & Management | VOL. 60

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.
Fumin Shen ... Xiang Zhou
IEEE Transactions on Image Processing | VOL. 28
Fumin Shen, et. al.Fumin Shen ... Xiang Zhou
18 Feb 2019
IEEE Transactions on Image Processing | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-Shot Learning via Latent Space Encoding.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics