DELTA: A deep dual-stream network for multi-label image classification

Wan-Jin Yu,Zhen-Duo Chen,Xin Luo,Wu Liu,Xin-Shun Xu

doi:10.1016/j.patcog.2019.03.006

Abstract

Multi-label image classification problem is one of the most important and fundamental problems in computer vision. In an image with multiple labels, the objects usually locate at various positions with different scales and poses. Moreover, some labels are associated with the entire image instead of a small region. Therefore, both the global and local information are important for classification. To effectively extract and make full use of these information, in this paper, we present a novel deep Dual-stream nEtwork for the muLTi-lAbel image classification task, DELTA for short. As its name indicates, it is composed of two streams, i.e., the Multi-Instance network and the Global Priors network. The former is used to extract the multi-scale class-related local instances features by modeling the classification problem in a multi-instance learning framework. The latter is devised to capture the global priors from the input image as the global information. These two streams are fused by the final fusion layer. In this way, DELTA can extract and make full use of both the global and local information for classification. Extensive experiments on three benchmark datasets, i.e., PASCAL VOC 2007, PASCAL VOC 2012 and Microsoft COCO, demonstrate that DELTA significantly outperforms several state-of-the-art methods. Moreover, DELTA can automatically locate the key image patterns that trigger the labels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DELTA: A deep dual-stream network for multi-label image classification

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Mar 11, 2019
Citations: 48

Similar Papers

Combining local and global hypotheses in deep neural network for multi-label image classification
Qinghua Yu ... Jizhong Zhao
Neurocomputing | VOL. 235
Qinghua Yu, et. al.Qinghua Yu ... Jizhong Zhao
25 Dec 2016
Neurocomputing | VOL. 235

Multiview matrix completion for multilabel image classification.
Yong Luo ... Dacheng Tao
IEEE Transactions on Image Processing | VOL. 24
Yong Luo, et. al.Yong Luo ... Dacheng Tao
09 Apr 2015
IEEE Transactions on Image Processing | VOL. 24

A Deep Learning Model for Multi-label Classification Using Capsule Networks
Diqi Pan ... Peiyu Kang
-
Diqi Pan, et. al.Diqi Pan ... Peiyu Kang
01 Jan 2019
01 Jan 2019

CTransCNN: Combining transformer and CNN in multilabel medical image classification
Xin Wu ... Shuangsheng Zhang
Knowledge-Based Systems | VOL. 281
Xin Wu, et. al.Xin Wu ... Shuangsheng Zhang
30 Sep 2023
Knowledge-Based Systems | VOL. 281

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DELTA: A deep dual-stream network for multi-label image classification

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition