Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder

Nan Xing,Yang Liu,Hong Zhu,Jungong Han,Jing Wang

doi:10.1109/access.2020.3046573

Abstract

Zero-shot learning (ZSL) is an effective method to perform the recognition task without any training samples of specific classes. Most existing ZSL models put emphasis on learning an embedding between visual space and semantic space directly. However, few ZSL models research whether the human-designed semantic features are discriminative enough to recognize different classes. Moreover, one-way mapping suffers from the project domain shift problem. In this article, we propose to learn a Discriminative Dual Semantic Auto-encoder (DDSA) based on the encoder-decoder paradigm to solve this problem. DDSA attempts to construct two bidirectional embeddings to connect the visual space and the semantic space with the help of the learned aligned space which includes discriminative information of the visual features and semantic features. Based on the DDSA, we additionally propose a Deep DDSA to capture deep aligned features that are more conducive to zero-shot classification. The key to the proposed framework is that it implicitly exact the principal information from visual space and semantic space to construct aligned features, which is not only semantic-preserving but also discriminative. Extensive experiments on five benchmarks (SUN, CUB, AWA1, AWA2 and aPY) demonstrate the effectiveness of the proposed framework with state-of-the-art performance obtained on both conventional ZSL and generalized ZSL settings.

Highlights

There are about 30,000 basic object categories and subordinate ones that human can recognize in the world
Few Zero-shot learning (ZSL) models research whether the humandesigned semantic features are discriminative enough to recognize different classes
Based on the Discriminative Dual Semantic Auto-encoder (DDSA), we propose a Deep DDSA to capture deep aligned features that are more conducive to zero-shot classification

Summary

Introduction

There are about 30,000 basic object categories and subordinate ones that human can recognize in the world. Human can even recognize new classes dynamically from few examples with little effort, but it is not easy for computer-based machine learning models that usually require thousands of labelled samples for training. Motivated by the ability of humans to recognize unseen examples, the research area of zero-shot learning (ZSL) has received increasing interests, which aims to make good use of previously learned knowledge to recognize new categories without the need for labelled training data. Test samples can be considered from both seen and unseen categories, which is called Generalized Zero-Shot Learning (GZSL). In real-world applications, seen categories are usually more common than unseen ones, the GZSL is more realistic and challenging than ZSL for practical recognition tasks

Objectives

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 43	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Semantic Autoencoder for Zero-Shot Learning
Elyor Kodirov ... Tao Xiang
-
Elyor Kodirov, et. al.Elyor Kodirov ... Tao Xiang
01 Jul 2017
01 Jul 2017

Bidirectional generative transductive zero-shot learning
Xinpeng Li ... Mao Ye
Neural Computing and Applications | VOL. 33
Xinpeng Li, et. al.Xinpeng Li ... Mao Ye
12 Sep 2020
Neural Computing and Applications | VOL. 33

MFF: Multi-modal feature fusion for zero-shot learning
Weipeng Cao ... Xizhao Wang
Neurocomputing | VOL. 510
Weipeng Cao, et. al.Weipeng Cao ... Xizhao Wang
09 Sep 2022
Neurocomputing | VOL. 510

VS-Boost: Boosting Visual-Semantic Association for Generalized Zero-Shot Learning
Xiaofan Li ... Yachao Zhang
-
Xiaofan Li, et. al.Xiaofan Li ... Yachao Zhang
01 Aug 2023
01 Aug 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access