Feature Information Research Articles

Strip steel is extensively utilized in industries such as automotive manufacturing and aerospace due to its superior machinability, economic benefits, and adaptability. However, defects on the surface of steel strips, such as inclusions, patches, and scratches, significantly affect the performance and service life of the product. Therefore, the salient object detection of surface defects on strip steel is crucial to ensure the quality of the final product. Many factors, such as the low contrast of surface defects on strip steel, the diversity of defect types, complex texture structures, and irregular defect distribution, hinder existing detection technologies from accurately identifying and segmenting defect areas against complex backgrounds. To address the above problems, we propose a novel detector called S3D-SOD for the salient object detection of strip steel surface defects. For the encoding stage, a residual self-attention block is proposed to explore semantic information cues of high-level features to locate and guide low-level feature information. In addition, we apply a general residual channel and spatial attention to low-level features, enabling the model to adaptively focus on the key channels and spatial areas of feature maps with high resolutions, thereby enhancing the encoder features and accelerating the convergence of the model. For the decoding stage, a simple residual decoder block with an upsampling operation is proposed to realize the integration and interaction of feature information between different layers. Here, the simple residual decoder block is used for feature integration due to the following observation: backbone networks like ResNet and the Swin Transformer, after being pretrained on the large dataset ImageNet and then fine-tuned on a smaller dataset for strip steel surface defects, are capable of extracting feature maps that contain both general image features and the specific characteristics required for the salient object detection of strip steel surface defects. The experimental results on the SD-saliency-900 dataset show that S3D-SOD is better than advanced methods, and it has strong generalization ability and robustness.

Read full abstract

While Transformer-based approaches have recently achieved notable success in super-resolution, their extensive computational requirements impede widespread practical adoption. High-resolution meteorological satellite cloud imagery is essential for weather analysis and forecasting. Enhancing image resolution through super-resolution techniques facilitates the accurate identification and localization of geographic features by meteorological systems. However, current super-resolution methods fail to restore the intricacies of cloud formations and complex regions fully. This research introduces a novel dual-path aggregation Transformer network (DPAT) tailored to enhance the super-resolution of meteorological satellite cloud images. The DPAT network adeptly captures cloud imagery's subtle details and textures, effectively addressing occlusions and the variability inherent in satellite imagery. It bolsters the model's ability to manage the complex attributes of cloud images through the introduction of the Dual-path Aggregation Self-Attention (DASA) mechanism and the Multi-scale Feature Aggregation Block (MFAB), thereby enhancing performance in processing intricate cloud features. The DASA mechanism synthesizes features across spatial, depth, and channel dimensions via a dual-path approach, thoroughly exploiting feature correlations. The MFAB, designed to supplant the multilayer perceptron, incorporates shift convolution and a multi-scale interaction block to augment feature information, compensating for the deficiency in local information absorption due to fixed receptive fields. Experimental outcomes indicate that DPAT delivers superior super-resolution outcomes. With a parameter count of only 32% of the Enhanced Deep Residual Network (EDSR) or 77% of the Image Restoration using Shift Window Transformer (SwinIR), DPAT matches SwinIR's performance on the satellite cloud dataset. Moreover, DPAT balances accuracy and parameter economy across various datasets. This technology is expected to improve image super-resolution capabilities in multiple fields such as human action recognition and industrial recognition, and indirectly improve the accuracy of image perception tasks.

Read full abstract

Feature Information Research Articles

Related Topics

Articles published on Feature Information

Self-supervised multimodal change detection based on difference contrast learning for remote sensing imagery

Gesture Recognition with Adaptive-Weight-Based Residual MultiheadCrossAttention Fusion Based on Multi-Level Feature Information

Combining multi-level feature extraction algorithm with residual graph convolutional neural network for partial discharge detection

DSIFNet: Implicit Feature Network for Nasal Cavity and Vestibule Segmentation from 3D Head CT

Enhancing physician support in pancreatic cancer diagnosis: New M-F-RCNN artificial intelligence model using endoscopic ultrasound.

A hybrid CNN-transformer network: Accurate and efficient semantic segmentation of crops and weeds on resource-constrained embedded devices

TDOcc: Exploit machine learning and big data in multi-view 3D occupancy prediction

A Strip Steel Surface Defect Salient Object Detection Based on Channel, Spatial and Self-Attention Mechanisms

Automatic Apple Detection and Counting with AD-YOLO and MR-SORT.

Dual-path aggregation transformer network for super-resolution with images occlusions and variability

Hierarchical-Concatenate Fusion TDNN for sound event classification.

Segmentation-Based Detection for Luffa Seedling Grading Using the Seg-FL Model

A graph convolutional neural network model based on fused multi-subgraph as input and fused feature information as output

TLCSFI: A Pose-Guided Person Re-Identification Method with Two-Level Channel–Spatial Feature Integration

Lightweight and efficient deep learning models for fruit detection in orchards

MS-YOLO: A Lightweight and High-Precision YOLO Model for Drowning Detection.

An improved YOLOv8n-IRP model for natural rubber tree tapping surface detection and tapping key point positioning.

Multi-Dimensional Fuzzy Clustering-Based Trajectory Initialization Algorithm for Infrared Weak Target Trajectories in Robust Clutter Environments

A Novel RUL-Centric Data Augmentation Method for Predicting the Remaining Useful Life of Bearings

Small-sample cucumber disease identification based on multimodal self-supervised learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Feature Information Research Articles

Related Topics

Articles published on Feature Information

Self-supervised multimodal change detection based on difference contrast learning for remote sensing imagery

Gesture Recognition with Adaptive-Weight-Based Residual MultiheadCrossAttention Fusion Based on Multi-Level Feature Information

Combining multi-level feature extraction algorithm with residual graph convolutional neural network for partial discharge detection

DSIFNet: Implicit Feature Network for Nasal Cavity and Vestibule Segmentation from 3D Head CT

Enhancing physician support in pancreatic cancer diagnosis: New M-F-RCNN artificial intelligence model using endoscopic ultrasound.

A hybrid CNN-transformer network: Accurate and efficient semantic segmentation of crops and weeds on resource-constrained embedded devices

TDOcc: Exploit machine learning and big data in multi-view 3D occupancy prediction

A Strip Steel Surface Defect Salient Object Detection Based on Channel, Spatial and Self-Attention Mechanisms

Automatic Apple Detection and Counting with AD-YOLO and MR-SORT.

Dual-path aggregation transformer network for super-resolution with images occlusions and variability

Hierarchical-Concatenate Fusion TDNN for sound event classification.

Segmentation-Based Detection for Luffa Seedling Grading Using the Seg-FL Model

A graph convolutional neural network model based on fused multi-subgraph as input and fused feature information as output

TLCSFI: A Pose-Guided Person Re-Identification Method with Two-Level Channel–Spatial Feature Integration

Lightweight and efficient deep learning models for fruit detection in orchards

MS-YOLO: A Lightweight and High-Precision YOLO Model for Drowning Detection.

An improved YOLOv8n-IRP model for natural rubber tree tapping surface detection and tapping key point positioning.

Multi-Dimensional Fuzzy Clustering-Based Trajectory Initialization Algorithm for Infrared Weak Target Trajectories in Robust Clutter Environments

A Novel RUL-Centric Data Augmentation Method for Predicting the Remaining Useful Life of Bearings

Small-sample cucumber disease identification based on multimodal self-supervised learning