Generalised Zero-shot Learning with Multi-modal Embedding Spaces

Rafael Felix,Gustavo Carneiro,Michele Sasdelli,Ben Harwood

doi:10.1109/dicta51227.2020.9363405

Abstract

Generalised zero-shot learning (GZSL) methods aim to classify previously seen and unseen visual classes by leveraging the semantic information of those classes. In the context of GZSL, semantic information is non-visual data such as a text description of the seen and unseen classes. Previous GZSL methods have explored transformations between visual and semantic spaces, as well as the learning of a latent joint visual and semantic space. In these methods, even though learning has explored a combination of spaces (i.e., visual, semantic or joint latent space), inference tended to focus on using just one of the spaces. By hypothesising that inference must explore all three spaces, we propose a new GZSL method based on a multimodal classification over visual, semantic and joint latent spaces. Another issue affecting current GZSL methods is the intrinsic bias toward the classification of seen classes - a problem that is usually mitigated by a domain classifier which modulates seen and unseen classification. Our proposed approach replaces the modulated classification by a computationally simpler multidomain classification based on averaging the multi-modal calibrated classifiers from the seen and unseen domains. Experiments on GZSL benchmarks show that our proposed GZSL approach achieves competitive results compared with the state-of-the-art.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalised Zero-shot Learning with Multi-modal Embedding Spaces

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Generalised Zero-Shot Learning with Domain Classification in a Joint Semantic and Visual Space
Rafael Felix ... Gustavo Carneiro
-
Rafael Felix, et. al.Rafael Felix ... Gustavo Carneiro
01 Dec 2019
01 Dec 2019

Explicit and Latent Topic Representations of Information Spaces in Social Information Retrieval
Christoph Fuchs ... Georg Groh
-
Christoph Fuchs, et. al.Christoph Fuchs ... Georg Groh
01 Sep 2016
01 Sep 2016

Augmentation Network for Generalised Zero-Shot Learning
Rafael Felix ... Michele Sasdelli
-
Rafael Felix, et. al.Rafael Felix ... Michele Sasdelli
01 Jan 2020
01 Jan 2020

Bidirectional generative transductive zero-shot learning
Xinpeng Li ... Mao Ye
Neural Computing and Applications | VOL. 33
Xinpeng Li, et. al.Xinpeng Li ... Mao Ye
12 Sep 2020
Neural Computing and Applications | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalised Zero-shot Learning with Multi-modal Embedding Spaces

Abstract

Talk to us

Similar Papers