Abstract

Existing attribute learning methods rely on predefined attributes, which require manual annotations. Due to the limitation of human experience, the predefined attributes are not capable enough of providing enough description. This paper proposes a self-supervised attribute learning (SAL) method, which automatically generates attribute descriptions by differentially occluding the object region to deal with the above problems. The relationship between attributes is formulated with triplet loss functions and is utilized to supervise the CNN. Attribute learning is used as an auxiliary task of a multitask image classification and segmentation network, in which self-supervision of attributes motivates the CNN to learn more discriminative features for the main semantic tasks. Experimental results on public benchmarks CUB-2011 and Pascal VOC show that the proposed SAL-Net can obtain more accurate classification and segmentation results without additional annotations. Moreover, the SAL-Net is embedded into a multiobject recognition and segmentation system, which realizes instance-aware semantic segmentation with the help of a region proposal algorithm and a fusion nonmaximum suppression algorithm.

Highlights

  • Visual attributes are designed as midlevel semantic features to describe objects

  • Unlike the aforementioned methods, which learn latent attributes complementary to human-defined attributes for cross-class transfer learning, this paper proposes a self-supervised attribute learning method for object recognition and segmentation, which does not require additional attribute annotations

  • Compared with SAL-Net, the only difference is that such a baseline applies supervision from manually annotated attributes, while SAL-Net is based on the proposed self-supervised attribute learning method

Read more

Summary

Introduction

Visual attributes are designed as midlevel semantic features to describe objects. most of the existing attribute learning methods [1,2,3,4,5,6,7] rely on manually annotated datasets [8,9,10,11]. Based on the proposed method, this paper extends our previous work [27] by replacing the manually annotated supervision with the formulated attribute relationship to improve object recognition and segmentation. The contributions of this paper can be summarized as follows: Wireless Communications and Mobile Computing (1) A self-supervised attribute learning (SAL) method is proposed by automatically generating attribute relationships to learn the CNN (2) We design a multitask deep neural network SALNet, in which attribute learning is an auxiliary selfsupervised task to alleviate intraclass variations in recognition tasks and refine the segmentation results (3) We embed the SAL-Net into an instance-aware semantic segmentation system, in which a Fusion NMS algorithm is proposed to deal with repeated extraction of the same object.

Related Work
Methodology
Experiments
Performance of SAL-Net
Methods
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call