Graph attention mechanism with global contextual information for multi-label image recognition

Xiaoxiao Ban,Shijie Guo,Yuanquan Wang,Qilong Wang,Shoujun Zhou,Peihua Li

doi:10.1117/1.jei.30.6.063031

Abstract

Recent works have shown that multi-label image recognition is still a challenging task in computer vision due to the complicatedness and diversity of multi-label images. However, the existing works ignore the co-occurrence correlation and global contextual information between image space and objects. We present a model to solve these problems. On the one hand, we devise the graph attention mechanism to compute the hidden representations of different categories in multi-label images. It can specify different weights to different neighbor objects and well model the label dependency. On the other hand, we iterate the global contextual information by the second-order covariance pooling to enhance nonlinear modeling capability and use basic residual network to extract features. The proposed model is thoroughly evaluated on PASCAL VOC 2007 and MS-COCO datasets. Compared with classical ML-GCN, the model can better combine the image features and label embedding. Meanwhile, experiments show that it outperforms the state-of-the-art methods such as residual multi-layer perceptron, EfficientNet, and vision transformer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Graph attention mechanism with global contextual information for multi-label image recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Journal: Journal of Electronic Imaging	Publication Date: Dec 28, 2021
Citations: 2

Similar Papers

HDAM: Heuristic Difference Attention Module for Convolutional Neural Networks
Yu Xue ... Ziming Yuan
Journal on Internet of Things | VOL. 4
Yu Xue, et. al.Yu Xue ... Ziming Yuan
01 Jan 2021
Journal on Internet of Things | VOL. 4

GC–HGNN: A global-context supported hypergraph neural network for enhancing session-based recommendation
Dunlu Peng ... Shuo Zhang
Electronic Commerce Research and Applications | VOL. 52
Dunlu Peng, et. al.Dunlu Peng ... Shuo Zhang
01 Mar 2022
Electronic Commerce Research and Applications | VOL. 52

Global context guided hierarchically residual feature refinement network for defocus blur detection
Yongping Zhai ... Chang Tang
Signal Processing | VOL. 183
Yongping Zhai, et. al.Yongping Zhai ... Chang Tang
20 Jan 2021
Signal Processing | VOL. 183

Keyword Extraction Using Support Vector Machine
Kuo Zhang ... Hui Xu
-
Kuo Zhang, et. al.Kuo Zhang ... Hui Xu
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph attention mechanism with global contextual information for multi-label image recognition

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging