Abstract
Salient object detection (SOD) methods typically consider SOD as a pixel-wise binary classification problem and utilize the binary cross-entropy (BCE) loss for optimization. However, the BCE loss ignores the global dependencies between pixels of the entire image, which is important for ensuring the accuracy and integrity of objects. To address this limitation explicitly, contrastive learning is introduced to enhance both the intra-pixel compactness of the foreground and inter-pixel separability between the foreground and background. Unfortunately, the random pixel sampling strategies of existing contrast algorithms cannot ensure that the sampled pixels cover all semantic contents across the entire image because of the lack of fine-grained semantic categories for pixels in SOD. Hence, in this study, a SuperPixel-wise Contrastive (SPCont) algorithm is proposed, which samples all superpixel centers to improve the diversity and representativeness of the sampled pixels. In addition, a SOD-specific hard pixel sampling strategy is designed to refine the edge details of the objects. An intra-image multi-level negative sampling approach is presented to increase the number of negative samples. Extensive experiments are conducted to demonstrate that incorporating the SPCont algorithm into the state-of-the-art network results in significant performance improvements. We expect that our work will open new paths for the application of contrastive learning to SOD.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.