Multilabel Image Classification via Feature/Label Co-Projection

Shiping Wen,Tingwen Huang,Zhenyuan Guo,Weiwei Liu,Zheng Yan,Pan Zhou,Yin Yang,Yiran Chen

doi:10.1109/tsmc.2020.2967071

Abstract

This article presents a simple and intuitive solution for multilabel image classification, which achieves the competitive performance on the popular COCO and PASCAL VOC benchmarks. The main idea is to capture how humans perform this task: we recognize both labels (i.e., objects and attributes) and the correlation of labels at the same time. Here, label recognition is performed by a standard ConvNet pipeline, whereas label correlation modeling is done by projecting both labels and image features extracted by the ConvNet to a common latent vector space. Specifically, we carefully design the loss function to ensure that: 1) labels and features that co-appear frequently are close to each other in the latent space and 2) conversely, labels/features that do not appear together are far apart. This information is then combined with the original ConvNet outputs to form the final prediction. The whole model is trained end-to-end, with no additional supervised information other than the image-level supervised information. Experiments show that the proposed method consistently outperforms previous approaches on COCO and PASCAL VOC in terms of mAP, macro/micro precision, recall, and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$F$ </tex-math></inline-formula> -measure. Further, our model is highly efficient at test time, with only a small number of additional weights compared to the base model for direct label recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Systems, Man, and Cybernetics: Systems	Publication Date: Feb 11, 2020
Citations: 68	License type: other-oa

R Discovery Prime

R Discovery Prime

Multilabel Image Classification via Feature/Label Co-Projection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics: Systems

Lead the way for us

Similar Papers

Multiview matrix completion for multilabel image classification.
Yong Luo ... Dacheng Tao
IEEE Transactions on Image Processing | VOL. 24
Yong Luo, et. al.Yong Luo ... Dacheng Tao
09 Apr 2015
IEEE Transactions on Image Processing | VOL. 24

Multi-label text classification via joint learning from label embedding and label correlation
Huiting Liu ... Xindong Wu
Neurocomputing | VOL. 460
Huiting Liu, et. al.Huiting Liu ... Xindong Wu
19 Jul 2021
Neurocomputing | VOL. 460

Conditional Graphical Lasso for Multi-label Image Classification
Qiang Li ... Maoying Qiao
-
Qiang Li, et. al.Qiang Li ... Maoying Qiao
01 Jun 2016
01 Jun 2016

Joint Input and Output Space Learning for Multi-Label Image Classification
Jiahao Xu ... Fang Chen
IEEE Transactions on Multimedia | VOL. 23
Jiahao Xu, et. al.Jiahao Xu ... Fang Chen
15 Jun 2020
IEEE Transactions on Multimedia | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilabel Image Classification via Feature/Label Co-Projection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Systems, Man, and Cybernetics: Systems