Attention enhanced machine instinctive vision with human-inspired saliency detection

Habib Khan,Muhammad Talha Usman,Imad Rida,Jakeoung Koo

doi:10.1016/j.imavis.2024.105308

Abstract

Salient object detection (SOD) enables machines to recognize and accurately segment visually prominent regions in images. Despite recent advancements, existing approaches often lack progressive fusion of low and high-level features, effective multi-scale feature handling, and precise boundary detection. Moreover, the robustness of these models under varied lighting conditions remains a concern. To overcome these challenges, we present Attention Enhanced Machine Instinctive Vision framework for SOD. The proposed framework leverages the strategy of Multi-stage Feature Refinement with Optimal Attentions-Driven Framework (MFRNet). The multi-level features are extracted from six stages of the EfficientNet-B7 backbone. This provides effective feature fusions of low and high-level details across various scales at the later stage of the framework. We introduce the Spatial-optimized Feature Attention (SOFA) module, which refines spatial features from three initial-stage feature maps. The extracted multi-scale features from the backbone are passed from the convolution feature transformation and spatial attention mechanisms to refine the low-level information. The SOFA module concatenates and upsamples these refined features, producing a comprehensive spatial representation of various levels. Moreover, the proposed Context-Aware Channel Refinement (CACR) module integrates dilated convolutions with optimized dilation rates followed by channel attention to capture multi-scale contextual information from the mature three layers. Furthermore, our progressive feature fusion strategy combines high-level semantic information and low-level spatial details through multiple residual connections, ensuring robust feature representation and effective gradient backpropagation. To enhance robustness, we train our network with augmented data featuring low and high brightness adjustments, improving its ability to handle diverse lighting conditions. Extensive experiments on four benchmark datasets—ECSSD, HKU-IS, DUTS, and PASCAL-S- validate the proposed framework’s effectiveness, demonstrating superior performance compared to existing SOTA methods in the domain. Code, qualitative results, and trained weights will be available to the research community at the link: https://github.com/habib1402/MFRNet-SOD.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention enhanced machine instinctive vision with human-inspired saliency detection

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Similar Papers

Salient Object Detection Using Recurrent Guidance Network With Hierarchical Attention Features
Shanmei Lu ... Yongxia Zhang
IEEE Access | VOL. 8
Shanmei Lu, et. al.Shanmei Lu ... Yongxia Zhang
01 Jan 2020
IEEE Access | VOL. 8

Multi-Guidance CNNs for Salient Object Detection
Shuaixiong Hui ... Qiang Guo
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19
Shuaixiong Hui, et. al.Shuaixiong Hui ... Qiang Guo
25 Feb 2023
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 19

Graphic image classification method based on an attention mechanism and fusion of multilevel and multiscale deep features
Shan Liu ... Lingling Huang
Computer Communications | VOL. 209
Shan Liu, et. al.Shan Liu ... Lingling Huang
08 Jul 2023
Computer Communications | VOL. 209

Lightweight multi-level feature difference fusion network for RGB-D-T salient object detection
Kechen Song ... Yunhui Yan
Journal of King Saud University - Computer and Information Sciences | VOL. 35
Kechen Song, et. al.Kechen Song ... Yunhui Yan
09 Aug 2023
Journal of King Saud University - Computer and Information Sciences | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention enhanced machine instinctive vision with human-inspired saliency detection

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing