Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.

Fumin Shen,Li Liu,Yang Yang,Heng Tao Shen,Jun Yu,Xiang Zhou

doi:10.1109/tip.2019.2899987

Abstract

Zero-shot learning aims to classify visual instances from unseen classes in the absence of training examples. This is typically achieved by directly mapping visual features to a semantic embedding space of classes (e.g., attributes or word vectors), where the similarity between the two modalities can be readily measured. However, the semantic space may not be reliable for recognition due to the noisy class embeddings or visual bias problem. In this work, we propose a novel Binary embedding based Zero-Shot Learning (BZSL) method, which recognizes visual instances from unseen classes through an intermediate discriminative Hamming space. Specifically, BZSL jointly learns two binary coding functions to encode both visual instances and class embeddings into the Hamming space, which well alleviates the visual-semantic bias problem. As a desiring property, classifying an unseen instance thereby can be efficiently done by retrieving its nearest-class codes with minimal Hamming distance. During training, by introducing two auxiliary variables for the coding functions, we formulate an equivalent correlation maximization problem, which admits an analytical solution. The resulting algorithm thus enjoys both highly efficient training and scalable novel class inferring. Extensive experiments on four benchmark datasets, including the full ImageNet Fall 2011 dataset with over 20K unseen classes, demonstrate the superiority of our method on the zero-shot learning task. Particularly, we show that increasing the binary embedding dimension can inevitably improve the recognition accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Journal: IEEE Transactions on Image Processing	Publication Date: Feb 18, 2019
Citations: 99

Similar Papers

Fusion by synthesizing: A multi-view deep neural network for zero-shot recognition
Xing Xu ... Xuelong Li
Signal Processing | VOL. 164
Xing Xu, et. al.Xing Xu ... Xuelong Li
21 May 2019
Signal Processing | VOL. 164

Bidirectional generative transductive zero-shot learning
Xinpeng Li ... Mao Ye
Neural Computing and Applications | VOL. 33
Xinpeng Li, et. al.Xinpeng Li ... Mao Ye
12 Sep 2020
Neural Computing and Applications | VOL. 33

An end-to-end deep generative approach with meta-learning optimization for zero-shot object classification
Xiaofeng Xu ... Guifu Lu
Information Processing & Management | VOL. 60
Xiaofeng Xu, et. al.Xiaofeng Xu ... Guifu Lu
16 Dec 2022
Information Processing & Management | VOL. 60

A Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion
Jingcai Guo ... Song Guo
IEEE Transactions on Multimedia | VOL. 23
Jingcai Guo, et. al.Jingcai Guo ... Song Guo
03 Apr 2020
IEEE Transactions on Multimedia | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Scalable Zero-Shot Learning via Binary Visual-Semantic Embeddings.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing