Dual Projective Zero-Shot Learning Using Text Descriptions

Yunbo Rao,Ziqiang Yang,Qifeng Wang,Shaoning Zeng,Jiansu Pu

doi:10.1145/3514247

Abstract

Zero-shot learning (ZSL) aims to recognize image instances of unseen classes solely based on the semantic descriptions of the unseen classes. In this field, Generalized Zero-Shot Learning (GZSL) is a challenging problem in which the images of both seen and unseen classes are mixed in the testing phase of learning. Existing methods formulate GZSL as a semantic-visual correspondence problem and apply generative models such as Generative Adversarial Networks and Variational Autoencoders to solve the problem. However, these methods suffer from the bias problem since the images of unseen classes are often misclassified into seen classes. In this work, a novel model named the Dual Projective model for Zero-Shot Learning (DPZSL) is proposed using text descriptions. In order to alleviate the bias problem, we leverage two autoencoders to project the visual and semantic features into a latent space and evaluate the embeddings by a visual-semantic correspondence loss function. An additional novel classifier is also introduced to ensure the discriminability of the embedded features. Our method focuses on a more challenging inductive ZSL setting in which only the labeled data from seen classes are used in the training phase. The experimental results, obtained from two popular datasets—Caltech-UCSD Birds-200-2011 (CUB) and North America Birds (NAB)—show that the proposed DPZSL model significantly outperforms both the inductive ZSL and GZSL settings. Particularly in the GZSL setting, our model yields an improvement up to 15.2% in comparison with state-of-the-art CANZSL on datasets CUB and NAB with two splittings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual Projective Zero-Shot Learning Using Text Descriptions

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Jan 5, 2023
Citations: 6

Similar Papers

Auto‐encode the synthesis pseudo features for generalized zero‐shot learning
Lin Wang ... Qingtao Wu
The Journal of Engineering | VOL. 2022
Lin Wang, et. al.Lin Wang ... Qingtao Wu
03 Sep 2022
The Journal of Engineering | VOL. 2022

Transfer Increment for Generalized Zero-Shot Learning
Liangjun Feng ... Chunhui Zhao
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32
Liangjun Feng, et. al.Liangjun Feng ... Chunhui Zhao
14 Jul 2020
IEEE Transactions on Neural Networks and Learning Systems | VOL. 32

Augmented semantic feature based generative network for generalized zero-shot learning
Zhiqun Li ... Qingfa Liu
Neural Networks | VOL. 143
Zhiqun Li, et. al.Zhiqun Li ... Qingfa Liu
21 Apr 2021
Neural Networks | VOL. 143

Bidirectional generative transductive zero-shot learning
Xinpeng Li ... Mao Ye
Neural Computing and Applications | VOL. 33
Xinpeng Li, et. al.Xinpeng Li ... Mao Ye
12 Sep 2020
Neural Computing and Applications | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual Projective Zero-Shot Learning Using Text Descriptions

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications