Abstract

Self-supervised learning has been shown to be highly effective to avoid the expensive cost of collecting and annotating large-scale datasets. In this paper, we explore its use in image co-saliency detection task which detects and segments the co-concurrent and conspicuous patterns in one image group, and a self-supervised framework is presented that contains three novel module designs. Specifically, we first design an unsupervised graph clustering algorithm to generate an initial detection result from extracted semantical features of convolutional networks. Then, the post-processed output is taken as the pseudo labels of the second module that learns a neural network for more accurate sample affinity estimation, followed by a manifold ranking based label smoothing. Finally, with the smoothed labels as supervision, we design a saliency network that aggregates a hierarchy of multi-scale features to predict final optimized co-saliency maps. In addition, the proposed framework can be readily extended to image saliency object detection task when giving a single image as input. Extensive evaluations on three co-saliency and five saliency object detection benchmark datasets demonstrate that the proposed framework achieves favorable performance against a variety of supervised and unsupervised state-of-the-arts.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.