Abstract

Recently, the performance of salient object detection (SOD) has been significantly improved by utilizing edge information for auxiliary training. However, the extraction and utilization of edge cues and multi-level feature fusion are still two issues in existing edge-aware models. In this paper, we devise a novel SOD network with edge-guided learning and specific aggregation, named ELSA-Net, to cooperatively address these two issues. First, we propose the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">edge-guided learning strategy</i> , which utilizes edge cues as low-level guidance to improve saliency prediction. Specifically, we design a two-stream model that uses a saliency branch and an edge branch to detect the interior and the boundary of salient objects, respectively. Then, an edge-guided interaction module (EGI) is further designed to achieve feature enhancement by embedding edge information into the saliency branch as the spatial weights. In addition, two <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">specific aggregation modules</i> are proposed for the progressive fusion of multi-level features in the above two streams, thus making full use of semantic and detailed information. The high-level interactive fusion module (HIF) leverages the correlation between two deeper features to obtain more powerful global contexts. And the low-level weighted fusion module (LWF) focuses on the complement of fine information by selectively integrating input features. Extensive experiments show that the proposed approach outperforms 19 state-of-the-art methods on five datasets, which validates its effectiveness both quantitatively and qualitatively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call