Multi-Label Image Classification with Attention Mechanism and Graph Convolutional Networks

Quanling Meng,Weigang Zhang

doi:10.1145/3338533.3366589

Quanling Meng, Weigang Zhang

https://doi.org/10.1145/3338533.3366589

Copy DOI

Export

Save

Cite

Publication Date: Dec 15, 2019

Citations: 21

Affiliation: Harbin Institute of Technology

Abstract
Full-Text
Similar Papers

Abstract

Listen

The task of multi-label image classification is to predict a set of proper labels for an input image. To this end, it is necessary to strengthen the association between the labels and the image regions, and utilize the relationship between the labels. In this paper, we propose a novel framework for multi-label image classification, which uses attention mechanism and Graph Convolutional Network (GCN) simultaneously. The attention mechanism can focus on specific target regions while ignoring other useless information around, thereby enhancing the association of the labels with the image regions. By constructing a directed graph over the labels, GCN can learn the relationship between the labels from a global perspective and map this label graph to a set of inter-dependent object classifiers. The framework first uses ResNet to extract features while using attention mechanism to generate attention maps for all labels and obtain weighted features. GCN uses weighted fusion features from the output of the resnet and attention mechanism to achieve classification. Experimental results show that both the attention mechanism and GCN can effectively improve the classification performance, and the proposed framework is competitive with the state-of-the-art methods.

Full Text