Low-level Details Research Articles

Salient object detection (SOD) enables machines to recognize and accurately segment visually prominent regions in images. Despite recent advancements, existing approaches often lack progressive fusion of low and high-level features, effective multi-scale feature handling, and precise boundary detection. Moreover, the robustness of these models under varied lighting conditions remains a concern. To overcome these challenges, we present Attention Enhanced Machine Instinctive Vision framework for SOD. The proposed framework leverages the strategy of Multi-stage Feature Refinement with Optimal Attentions-Driven Framework (MFRNet). The multi-level features are extracted from six stages of the EfficientNet-B7 backbone. This provides effective feature fusions of low and high-level details across various scales at the later stage of the framework. We introduce the Spatial-optimized Feature Attention (SOFA) module, which refines spatial features from three initial-stage feature maps. The extracted multi-scale features from the backbone are passed from the convolution feature transformation and spatial attention mechanisms to refine the low-level information. The SOFA module concatenates and upsamples these refined features, producing a comprehensive spatial representation of various levels. Moreover, the proposed Context-Aware Channel Refinement (CACR) module integrates dilated convolutions with optimized dilation rates followed by channel attention to capture multi-scale contextual information from the mature three layers. Furthermore, our progressive feature fusion strategy combines high-level semantic information and low-level spatial details through multiple residual connections, ensuring robust feature representation and effective gradient backpropagation. To enhance robustness, we train our network with augmented data featuring low and high brightness adjustments, improving its ability to handle diverse lighting conditions. Extensive experiments on four benchmark datasets — ECSSD, HKU-IS, DUTS, and PASCAL-S — validate the proposed framework’s effectiveness, demonstrating superior performance compared to existing SOTA methods in the domain. Code, qualitative results, and trained weights will be available at the link: https://github.com/habib1402/MFRNet-SOD.

Read full abstract

In the multi-organ segmentation task of medical images, there are some challenging issues such as the complex background, blurred boundaries between organs, and the larger scale difference in volume. Due to the local receptive fields of conventional convolution operations, it is difficult to obtain desirable results by directly using them for multi-organ segmentation. While Transformer-based models have global information, there is a significant dependency on hardware because of the high computational demands. Meanwhile, the depthwise convolution with large kernel can capture global information and have less computational requirements. Therefore, to leverage the large receptive field and reduce model complexity, we propose a novel CNN-based approach, namely adjacent-scale fusion U-Net with large kernel (ASF-LKUNet) for multi-organ segmentation. We utilize a u-shaped encoder–decoder as the base architecture of ASF-LKUNet. In the encoder path, we design the large kernel residual block, which combines the large and small kernels and can simultaneously capture the global and local features. Furthermore, for the first time, we propose an adjacent-scale fusion and large kernel GRN channel attention that incorporates the low-level details with the high-level semantics by the adjacent-scale feature and then adaptively focuses on the more global and meaningful channel information. Extensive experiments and interpretability analysis are made on the Synapse multi-organ dataset (Synapse) and the ACDC cardiac multi-structure dataset (ACDC). Our proposed ASF-LKUNet achieves 88.41% and 89.45% DSC scores on the Synapse and ACDC datasets, respectively, with 17.96M parameters and 29.14 GFLOPs. These results show that our method achieves superior performance with favorable lower complexity against ten competing approaches.ASF-LKUNet is superior to various competing methods and has less model complexity. Code and the trained models have been released on GitHub.

Read full abstract

Low-level Details Research Articles

Related Topics

Articles published on Low-level Details

GeoDTR+: Toward Generic Cross-View Geolocalization via Geometric Disentanglement.

Building Extraction from Unmanned Aerial Vehicle (UAV) Data in a Landslide-Affected Scattered Mountainous Area Based on Res-Unet

Exergetic port-Hamiltonian systems for multibody dynamics

Attention enhanced machine instinctive vision with human-inspired saliency detection

Feature Enhancement Based Oriented Object Detection in Remote Sensing Images

A Multi-Scale Feature Fusion Deep Learning Network for the Extraction of Cropland Based on Landsat Data

Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction.

Driver Distraction Detection Based on Fusion Enhancement and Global Saliency Optimization

ACU-TransNet: Attention and convolution-augmented UNet-transformer network for polyp segmentation.

Do we need a high level of detail in health information animations? An experimental study investigating the association between level of detail and information recall.

Attention-Guided Sample-Based Feature Enhancement Network for Crowded Pedestrian Detection Using Vision Sensors.

An attentional mechanism model for segmenting multiple lesion regions in the diabetic retina.

Pyecsca: Reverse engineering black-box elliptic curve cryptography via side-channel analysis

Dual cross-enhancement network for highly accurate dichotomous image segmentation

Dual-consistency guidance semi-supervised medical image segmentation with low-level detail feature augmentation

ASF-LKUNet: Adjacent-scale fusion U-Net with large kernel for multi-organ segmentation

Synchronous Programming with Refinement Types

DERE-Net: A dual-encoder residual enhanced U-Net for muscle fiber segmentation of H&E images

Boundary-Aware Gradient Operator Network for Medical Image Segmentation.

Conflicts of interest in the hiring of non-audit services provided by audit firms: are the anti-conflict corporate policies adopted in Brazil adequate?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low-level Details Research Articles

Related Topics

Articles published on Low-level Details

GeoDTR+: Toward Generic Cross-View Geolocalization via Geometric Disentanglement.

Building Extraction from Unmanned Aerial Vehicle (UAV) Data in a Landslide-Affected Scattered Mountainous Area Based on Res-Unet

Exergetic port-Hamiltonian systems for multibody dynamics

Attention enhanced machine instinctive vision with human-inspired saliency detection

Feature Enhancement Based Oriented Object Detection in Remote Sensing Images

A Multi-Scale Feature Fusion Deep Learning Network for the Extraction of Cropland Based on Landsat Data

Point-MPP: Point Cloud Self-Supervised Learning From Masked Position Prediction.

Driver Distraction Detection Based on Fusion Enhancement and Global Saliency Optimization

ACU-TransNet: Attention and convolution-augmented UNet-transformer network for polyp segmentation.

Do we need a high level of detail in health information animations? An experimental study investigating the association between level of detail and information recall.

Attention-Guided Sample-Based Feature Enhancement Network for Crowded Pedestrian Detection Using Vision Sensors.

An attentional mechanism model for segmenting multiple lesion regions in the diabetic retina.

Pyecsca: Reverse engineering black-box elliptic curve cryptography via side-channel analysis

Dual cross-enhancement network for highly accurate dichotomous image segmentation

Dual-consistency guidance semi-supervised medical image segmentation with low-level detail feature augmentation

ASF-LKUNet: Adjacent-scale fusion U-Net with large kernel for multi-organ segmentation

Synchronous Programming with Refinement Types

DERE-Net: A dual-encoder residual enhanced U-Net for muscle fiber segmentation of H&E images

Boundary-Aware Gradient Operator Network for Medical Image Segmentation.

Conflicts of interest in the hiring of non-audit services provided by audit firms: are the anti-conflict corporate policies adopted in Brazil adequate?