Low-resolution Feature Map Research Articles

Underwater object detection plays a significant role in marine ecosystem research and marine species conservation. The improvement of related technologies holds practical significance. Although existing object-detection algorithms have achieved an excellent performance on land, they are not satisfactory in underwater scenarios due to two limitations: the underwater objects are often small, densely distributed, and prone to occlusion characteristics, and underwater embedded devices have limited storage and computational capabilities. In this paper, we propose a high-precision, lightweight underwater detector specifically optimizing for underwater scenarios based on the You Only Look Once Version 8 (YOLOv8) model. Firstly, we replace the Darknet-53 backbone of YOLOv8s with FasterNet-T0, reducing model parameters by 22.52%, FLOPS by 23.59%, and model size by 22.73%, achieving model lightweighting. Secondly, we add a Prediction Head for Small Objects, increase the number of channels for high-resolution feature map detection heads, and decrease the number of channels for low-resolution feature map detection heads. This results in a 1.2% improvement in small-object detection accuracy, while the remaining model parameters and memory consumption are nearly unchanged. Thirdly, we use Deformable ConvNets and Coordinate Attention in the neck part to enhance the accuracy in the detection of irregularly shaped and densely occluded small targets. This is achieved by learning convolution offsets from feature maps and emphasizing the regions of interest (RoIs). Our method achieves 52.12% AP on the underwater dataset UTDAC2020, with only 8.5 M parameters, 25.5 B FLOPS, and 17 MB model size. It surpasses the performance of large model YOLOv8l, at 51.69% AP, with 43.6 M parameters, 164.8 B FLOPS, and 84 MB model size. Furthermore, by increasing the input image resolution to 1280 × 1280 pixels, our model achieves 53.18% AP, making it the state-of-the-art (SOTA) model for the UTDAC2020 underwater dataset. Additionally, we achieve 84.4% mAP on the Pascal VOC dataset, with a substantial reduction in model parameters compared to previous, well-established detectors. The experimental results demonstrate that our proposed lightweight method retains effectiveness on underwater datasets and can be generalized to common datasets.

Read full abstract

Background and objectiveMagnetic Resonance Image (MRI) analysis can provide anatomical examination of internal organs, which is helpful for diagnosis of the disease. Aiming at the problems of insufficient feature information mining in the process of MRI super-resolution (SR) reconstruction, the difficulty of determining the interdependence between the channels of the feature map, and the reconstruction error when reconstructing high-resolution (HR) images, we propose a SR method to solve these problems. MethodsIn this work, we propose a gradual back-projection residual attention network for MRI super-resolution (GRAN), which outperforms most of the state-of-the-art methods. Firstly, we use the gradual upsampling method to gradually scale the low-resolution (LR) image to a given magnification to alleviate the high-frequency information loss caused by the upsampling process. Secondly, we merge the idea of iterative back-projection at each stage of gradual upsampling, learn the mapping relationship between HR and LR feature maps and reduce the noise introduced during the upsampling process. Finally, we use the attention mechanism to dynamically allocate attention resources to the feature maps generated at different stages of the gradual back-projection network, so that the network model can learn the interdependence between each feature map. ResultsFor the 2 × and 4 × enlargement, the proposed GRAN method shows the superiority over the state-of-the-art methods on the Set5, Set14, and Urban100 benchmark datasets, extensive benchmark experiment and analysis show that the superiority of the GRAN algorithm in terms of peak signal-to-noise ratio and structural similarity index indicators. ConclusionThe MRI results reconstructed by gradual back-projection residual attention network on the public dataset IDI have good image sharpness, rich texture details and good visual experience. In addition, the reconstructed image is the closest to the real image, enabling the medical expert to see the biological tissue structure and its early pathological changes more clearly, providing assistance and support to the medical expert in the diagnosis and treatment of the disease.

Read full abstract

Low-resolution Feature Map Research Articles

Related Topics

Articles published on Low-resolution Feature Map

Lightweight medical image segmentation network with multi-scale feature-guided fusion

An enhanced approach for few-shot segmentation via smooth downsampling mask and label smoothing loss

Attentional decoder networks for chest X-ray image recognition on high-resolution features

Efficient Small-Object Detection in Underwater Images Using the Enhanced YOLOv8 Network

Gradient-guided hierarchical feature attack for object detector

Coarse-to-fine matching via cross fusion of satellite images

HS-YOLO: Small Object Detection for Power Operation Scenarios

SADENet: A supervised attention delicate enhanced network for subtle person detection

SSP-Net: Scalable sequential pyramid networks for real-Time 3D human pose regression

High-Resolution Swin Transformer for Automatic Medical Image Segmentation.

Progressive Transformer Machine for Natural Character Reenactment

Fast-HBNet: Hybrid Branch Network for Fast Lane Detection

Balanced Spatial Feature Distillation and Pyramid Attention Network for Lightweight Image Super-resolution

Deep feature pyramid network for EEG emotion recognition

Multi-Scale Semi-Coupled Convolutional Sparse Coding for the Super-Resolution Reconstruction of Remote Sensing Image

Concrete crack segmentation based on UAV-enabled edge computing

Multi-attention augmented network for single image super-resolution

Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild

Gradual back-projection residual attention network for magnetic resonance image super-resolution

3D axial-attention for lung nodule classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low-resolution Feature Map Research Articles

Related Topics

Articles published on Low-resolution Feature Map

Lightweight medical image segmentation network with multi-scale feature-guided fusion

An enhanced approach for few-shot segmentation via smooth downsampling mask and label smoothing loss

Attentional decoder networks for chest X-ray image recognition on high-resolution features

Efficient Small-Object Detection in Underwater Images Using the Enhanced YOLOv8 Network

Gradient-guided hierarchical feature attack for object detector

Coarse-to-fine matching via cross fusion of satellite images

HS-YOLO: Small Object Detection for Power Operation Scenarios

SADENet: A supervised attention delicate enhanced network for subtle person detection

SSP-Net: Scalable sequential pyramid networks for real-Time 3D human pose regression

High-Resolution Swin Transformer for Automatic Medical Image Segmentation.

Progressive Transformer Machine for Natural Character Reenactment

Fast-HBNet: Hybrid Branch Network for Fast Lane Detection

Balanced Spatial Feature Distillation and Pyramid Attention Network for Lightweight Image Super-resolution

Deep feature pyramid network for EEG emotion recognition

Multi-Scale Semi-Coupled Convolutional Sparse Coding for the Super-Resolution Reconstruction of Remote Sensing Image

Concrete crack segmentation based on UAV-enabled edge computing

Multi-attention augmented network for single image super-resolution

Pixel-in-Pixel Net: Towards Efficient Facial Landmark Detection in the Wild

Gradual back-projection residual attention network for magnetic resonance image super-resolution

3D axial-attention for lung nodule classification