Down-sampling Layers Research Articles

Objectives:Accurate extraction of regions of interest (ROI) with variable shapes and scales is one of the primary challenges in medical image segmentation. Current U-based networks mostly aggregate multi-stage encoding outputs as an improved multi-scale skip connection. Although this design has been proven to provide scale diversity and contextual integrity, there remain several intuitive limits: (i) the encoding outputs are resampled to the same size simply, which destruct the fine-grained information. The advantages of utilization of multiple scales are insufficient. (ii) Certain redundant information proportional to the feature dimension size is introduced and causes multi-stage interference. And (iii) the precision of information delivery relies on the up-sampling and down-sampling layers, but guidance on maintaining consistency in feature locations and trends between them is lacking. Methods:To improve these situations, this paper proposed a U-based CNN network named HAD-Net, by assembling a new hyper-scale shifted aggregating module (HSAM) paradigm and progressive reusing attention (PRA) for skip connections, as well as employing a novel pair of dual-branch parameter-free sampling layers, i.e. max-diagonal pooling (MDP) and max-diagonal un-pooling (MDUP). That is, the aggregating scheme additionally combines five subregions with certain offsets in the shallower stage. Since the lower scale-down ratios of subregions enrich scales and fine-grain context. Then, the attention scheme contains a partial-to-global channel attention (PGCA) and a multi-scale reusing spatial attention (MRSA), it builds reusing connections internally and adjusts the focus on more useful dimensions. Finally, MDP and MDUP are explored in pairs to improve texture delivery and feature consistency, enhancing information retention and avoiding positional confusion. Results:Compared to state-of-the-art networks, HAD-Net has achieved comparable and even better performances with Dice of 90.13%, 81.51%, and 75.43% for each class on BraTS20, 89.59% Dice and 98.56% AUC on Kvasir-SEG, as well as 82.17% Dice and 98.05% AUC on DRIVE. Conclusions:The scheme of HSAM+PRA+MDP+MDUP has been proven to be a remarkable improvement and leaves room for further research.

Read full abstract

Synthetic Aperture Radar (SAR) is a high-resolution imaging sensor commonly mounted on platforms such as airplanes and satellites for widespread use. In complex electromagnetic environments, radio frequency interference (RFI) severely degrades the quality of SAR images due to its widely varying bandwidth and numerous unknown emission sources. Although traditional deep learning-based methods have achieved remarkable results by directly processing SAR images as visual ones, there is still considerable room for improvement in their performance due to the wide coverage and high intensity of RFI. To address these issues, this paper proposes the fusion of segmentation and inpainting networks (FuSINet) to suppress SAR RFI in the time-frequency domain. Firstly, to weaken the dominance of RFI in SAR images caused by high-intensity interference, a simple CCN-based network is employed to learn and segment the RFI. This results in the removal of most of the original interference, leaving blanks that allow the targets to regain dominance in the overall image. Secondly, considering the wide coverage characteristic of RFI, a U-former network with global information capture capabilities is utilized to learn the content covered by the interference and fill in the blanks created by the segmentation network. Compared to the traditional Transformer, this paper enhances its global information capture capabilities through shift-windows and down-sampling layers. Finally, the segmentation and inpainting networks are fused together through a weighted parameter for joint training. This not only accelerates the learning speed but also enables better coordination between the two networks, leading to improved RFI suppression performance. Extensive experimental results demonstrate the substantial performance enhancement of the proposed FuSINet. Compared to the PISNet+, the proposed attention mechanism achieves a 2.49 dB improvement in peak signal-to-noise ratio (PSNR). Furthermore, compared to Uformer, the FuSINet achieves an additional 4.16 dB improvement in PSNR.

Read full abstract

Down-sampling Layers Research Articles

Related Topics

Articles published on Down-sampling Layers

HAD-Net: An attention U-based network with hyper-scale shifted aggregating and max-diagonal sampling for medical image segmentation

Vision foundation model for agricultural applications with efficient layer aggregation network

The weighted multi-scale connections networks for macrodispersivity estimation

Efficient Mixed-Type Wafer Defect Pattern Recognition Based on Light-Weight Neural Network.

SPMUNet: Semantic segmentation of citrus surface defects driven by superpixel feature

MultiTumor Analyzer (MTA-20–55): A network for efficient classification of detected brain tumors from MRI images

Synthetic Aperture Radar Radio Frequency Interference Suppression Method Based on Fusing Segmentation and Inpainting Networks

Real-time and lightweight detection of grape diseases based on Fusion Transformer YOLO.

ACANet: A Fine-grained Image Classification Optimization Method Based on Convolution and Attention Fusion

AnomalySeg: Deep Learning-Based Fast Anomaly Segmentation Approach for Surface Defect Detection

EDTNet: A spatial aware attention-based transformer for the pulmonary nodule segmentation.

Graphformer: Adaptive graph correlation transformer for multivariate long sequence time series forecasting

Segmentation of lung nodules based on a refined segmentation network.

YOLO-SDLUWD: YOLOv7-based small target detection network for infrared images in complex backgrounds

SliceSamp: A Promising Downsampling Alternative for Retaining Information in a Neural Network

Blueprint Separable Subsampling and Aggregate Feature Conformer-Based End-to-End Neural Diarization

Efficient Distributed Mapping-Based Computation for Convolutional Neural Networks in Multi-Core Embedded Parallel Environment

From coarse to fine: a deep 3D probability volume contours framework for tumour segmentation and dose painting in PET images.

A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer.

Enhanced YOLOv5 Object Detection Algorithm for Accurate Detection of Adult Rhynchophorus ferrugineus.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Down-sampling Layers Research Articles

Related Topics

Articles published on Down-sampling Layers

HAD-Net: An attention U-based network with hyper-scale shifted aggregating and max-diagonal sampling for medical image segmentation

Vision foundation model for agricultural applications with efficient layer aggregation network

The weighted multi-scale connections networks for macrodispersivity estimation

Efficient Mixed-Type Wafer Defect Pattern Recognition Based on Light-Weight Neural Network.

SPMUNet: Semantic segmentation of citrus surface defects driven by superpixel feature

MultiTumor Analyzer (MTA-20–55): A network for efficient classification of detected brain tumors from MRI images

Synthetic Aperture Radar Radio Frequency Interference Suppression Method Based on Fusing Segmentation and Inpainting Networks

Real-time and lightweight detection of grape diseases based on Fusion Transformer YOLO.

ACANet: A Fine-grained Image Classification Optimization Method Based on Convolution and Attention Fusion

AnomalySeg: Deep Learning-Based Fast Anomaly Segmentation Approach for Surface Defect Detection

EDTNet: A spatial aware attention-based transformer for the pulmonary nodule segmentation.

Graphformer: Adaptive graph correlation transformer for multivariate long sequence time series forecasting

Segmentation of lung nodules based on a refined segmentation network.

YOLO-SDLUWD: YOLOv7-based small target detection network for infrared images in complex backgrounds

SliceSamp: A Promising Downsampling Alternative for Retaining Information in a Neural Network

Blueprint Separable Subsampling and Aggregate Feature Conformer-Based End-to-End Neural Diarization

Efficient Distributed Mapping-Based Computation for Convolutional Neural Networks in Multi-Core Embedded Parallel Environment

From coarse to fine: a deep 3D probability volume contours framework for tumour segmentation and dose painting in PET images.

A Novel Deep Learning Model for Medical Image Segmentation with Convolutional Neural Network and Transformer.

Enhanced YOLOv5 Object Detection Algorithm for Accurate Detection of Adult Rhynchophorus ferrugineus.