Learning Multipart Attention Neural Network for Zero-Shot Classification

Min Meng,Jie Wei,Jigang Wu

doi:10.1109/tcds.2020.3044313

Abstract

Zero-shot learning (ZSL) models typically learn a cross-modal mapping between the visual feature space and the semantic embedding space. Despite promising performance achieved by existing methods, they usually take visual features from the whole image as the main proposed inputs, while pay little attention to image regions which are relevant to human’s visual response to the whole image. In this article, we propose a neural network-based ZSL model which incorporates an attention mechanism to discover the discriminative parts for each image. The proposed model allows us to automatically generate attention maps for visual parts, which provides a flexible way of encoding the salient visual aspects to distinguish the categories. Moreover, we introduce a simple yet effective objective function to exploit the pairwise label information between images and classes, resulting in substantial performance improvement. When multiple semantic spaces are available, a multiple-attention scheme is provided to fuse different semantic spaces, which helps to achieve further improvement in performance. On the widely used CUB-2010-2011 data set for fine-grained image classification, we demonstrate the advantages of using attention mechanism and semantic parts in our model for ZSL. Comprehensive experimental results show that our proposed approach achieves superior performance than the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Multipart Attention Neural Network for Zero-Shot Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cognitive and Developmental Systems

Lead the way for us

Journal: IEEE Transactions on Cognitive and Developmental Systems	Publication Date: Dec 15, 2020
Citations: 9

Similar Papers

Semantic Embedding Guided Attention with Explicit Visual Feature Fusion for Video Captioning
Shanshan Dong ... Xinshun Xu
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19
Shanshan Dong, et. al.Shanshan Dong ... Xinshun Xu
06 Feb 2023
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19

A Novel Perspective to Zero-Shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion
Jingcai Guo ... Song Guo
IEEE Transactions on Multimedia | VOL. 23
Jingcai Guo, et. al.Jingcai Guo ... Song Guo
03 Apr 2020
IEEE Transactions on Multimedia | VOL. 23

Bidirectional generative transductive zero-shot learning
Xinpeng Li ... Mao Ye
Neural Computing and Applications | VOL. 33
Xinpeng Li, et. al.Xinpeng Li ... Mao Ye
12 Sep 2020
Neural Computing and Applications | VOL. 33

Zero-shot object recognition by semantic manifold distance
Zhenyong Fu ... Shaogang Gong
-
Zhenyong Fu, et. al.Zhenyong Fu ... Shaogang Gong
01 Jun 2015
01 Jun 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Multipart Attention Neural Network for Zero-Shot Classification

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cognitive and Developmental Systems