Abstract

Salient object detection (SOD) methods typically consider SOD as a pixel-wise binary classification problem and utilize the binary cross-entropy (BCE) loss for optimization. However, the BCE loss ignores the global dependencies between pixels of the entire image, which is important for ensuring the accuracy and integrity of objects. To address this limitation explicitly, contrastive learning is introduced to enhance both the intra-pixel compactness of the foreground and inter-pixel separability between the foreground and background. Unfortunately, the random pixel sampling strategies of existing contrast algorithms cannot ensure that the sampled pixels cover all semantic contents across the entire image because of the lack of fine-grained semantic categories for pixels in SOD. Hence, in this study, a SuperPixel-wise Contrastive (SPCont) algorithm is proposed, which samples all superpixel centers to improve the diversity and representativeness of the sampled pixels. In addition, a SOD-specific hard pixel sampling strategy is designed to refine the edge details of the objects. An intra-image multi-level negative sampling approach is presented to increase the number of negative samples. Extensive experiments are conducted to demonstrate that incorporating the SPCont algorithm into the state-of-the-art network results in significant performance improvements. We expect that our work will open new paths for the application of contrastive learning to SOD.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call