Feature Fusion Module Research Articles

In scenarios such as nearshore and inland waterways, the ship spots in a marine radar are easily confused with reefs and shorelines, leading to difficulties in ship identification. In such settings, the conventional ARPA method based on fractal detection and filter tracking performs relatively poorly. To accurately identify radar targets in such scenarios, a novel algorithm, namely YOSMR, based on the deep convolutional network, is proposed. The YOSMR uses the MobileNetV3(Large) network to extract ship imaging data of diverse depths and acquire feature data of various ships. Meanwhile, taking into account the issue of feature suppression for small-scale targets in algorithms composed of deep convolutional networks, the feature fusion module known as PANet has been subject to a lightweight reconstruction leveraging depthwise separable convolutions to enhance the extraction of salient features for small-scale ships while reducing model parameters and computational complexity to mitigate overfitting problems. To enhance the scale invariance of convolutional features, the feature extraction backbone is followed by an SPP module, which employs a design of four max-pooling constructs to preserve the prominent ship features within the feature representations. In the prediction head, the Cluster-NMS method and α-DIoU function are used to optimize non-maximum suppression (NMS) and positioning loss of prediction boxes, improving the accuracy and convergence speed of the algorithm. The experiments showed that the recall, accuracy, and precision of YOSMR reached 0.9308, 0.9204, and 0.9215, respectively. The identification efficacy of this algorithm exceeds that of various YOLO algorithms and other lightweight algorithms. In addition, the parameter size and calculational consumption were controlled to only 12.4 M and 8.63 G, respectively, exhibiting an 80.18% and 86.9% decrease compared to the standard YOLO model. As a result, the YOSMR displays a substantial advantage in terms of convolutional computation. Hence, the algorithm achieves an accurate identification of ships with different trail features and various scenes in marine radar images, especially in different interference and extreme scenarios, showing good robustness and applicability.

Read full abstract

Abstract Industrial surface defect detection is an important part of industrial production, which aims to identify and detecting various defects on the surface of product to ensure quality and meet customer requirements. With the development of deep learning and image processing technologies, the surface defect detection methods based on computer vision has become the mainstream method. However, the prevalent convolutional neural network-based defect detection methods also have many problems. For example, these methods rely on post-processing of Non-Maximum Suppression and have poor detection ability for small targets, which affects the speed and accuracy of surface defect detection in industrial scenarios. Therefore, we propose a novel DEtection TRansformer-based surface defect detection method. Firstly, we propose a Multi-scale Contextual Information Dilated module and fuse it into the backbone. The module is mainly composed of large kernel convolutions, which aims to expand the receptive field of the model, thus reducing the leakage rate of the model. Moreover, we design an efficient encoder which mainly contains two important modules, namely feature enhancement based on cascaded group attention module and efficient feature fusion module based on content-aware. The former module effectively enhances the high-level semantic information extracted by the backbone, thus enabling the model to better interpret features, and it can improve the problem of high computational cost of transformer encoder, thus increasing the detection speed. The latter module performs multi-scale feature fusion across the feature information of various scales, thus improving the detection accuracy of the model for small-size defects. Experimental results show that the proposed method achieves 80.6%mAP and 80.3FPS on NEU-DET, and 98.0%mAP and 79.4FPS on PCB-DET. Our proposed method exhibits excellent detection performance and achieves real-time and efficient surface defect detection capability to meet the needs of industrial surface defect detection.

Read full abstract

Feature Fusion Module Research Articles

Related Topics

Articles published on Feature Fusion Module

Human pose estimation in complex background videos via Transformer-based multi-scale feature integration

DEAF-Net: Detail-Enhanced Attention Feature Fusion Network for Retinal Vessel Segmentation.

YOSMR: A Ship Detection Method for Marine Radar Based on Customized Lightweight Convolutional Networks

TSEDNet:Task-specific encoder–decoder network for surface defects of strip steel

Direction-aware multi-branch attention and Gaussian label assignment for remote sensing aggregative object detection

A multi-scale attributes fusion model for travel mode identification using GPS trajectories

PSO-based lightweight neural architecture search for object detection

Global and local complementary multi-path feature fusion network for the classification of crop remote sensing images

Hypergraph clustering based multi-label cross-modal retrieval

A lightweight speech enhancement network fusing bone- and air-conducted speech.

BiSTNet: Semantic Image Prior Guided Bidirectional Temporal Feature Fusion for Deep Exemplar-Based Video Colorization.

Ancient paintings inpainting based on dual encoders and contextual information

LWSDNet: A Lightweight Wheat Scab Detection Network Based on UAV Remote Sensing Images

MAA-YOLOv8: enhanced steel surface defect detection through multi-head attention mechanism and lightweight feature fusion

Hybrid modeling for vehicle lateral dynamics via AGRU with a dual-attention mechanism under limited data

Self-Supervised Monocular Depth Estimation for Endoscopic Imaging.

A duplex transform heterogeneous feature fusion network for road segmentation

SGDFormer: One-stage transformer-based architecture for cross-spectral stereo image guided denoising

A method of dense point cloud SLAM based on improved YOLOV8 and fused with ORB-SLAM3 to cope with dynamic environments

REDef-DETR: real-time and efficient DETR for industrial surface defect detection

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Feature Fusion Module Research Articles

Related Topics

Articles published on Feature Fusion Module

Human pose estimation in complex background videos via Transformer-based multi-scale feature integration

DEAF-Net: Detail-Enhanced Attention Feature Fusion Network for Retinal Vessel Segmentation.

YOSMR: A Ship Detection Method for Marine Radar Based on Customized Lightweight Convolutional Networks

TSEDNet:Task-specific encoder–decoder network for surface defects of strip steel

Direction-aware multi-branch attention and Gaussian label assignment for remote sensing aggregative object detection

A multi-scale attributes fusion model for travel mode identification using GPS trajectories

PSO-based lightweight neural architecture search for object detection

Global and local complementary multi-path feature fusion network for the classification of crop remote sensing images

Hypergraph clustering based multi-label cross-modal retrieval

A lightweight speech enhancement network fusing bone- and air-conducted speech.

BiSTNet: Semantic Image Prior Guided Bidirectional Temporal Feature Fusion for Deep Exemplar-Based Video Colorization.

Ancient paintings inpainting based on dual encoders and contextual information

LWSDNet: A Lightweight Wheat Scab Detection Network Based on UAV Remote Sensing Images

MAA-YOLOv8: enhanced steel surface defect detection through multi-head attention mechanism and lightweight feature fusion

Hybrid modeling for vehicle lateral dynamics via AGRU with a dual-attention mechanism under limited data

Self-Supervised Monocular Depth Estimation for Endoscopic Imaging.

A duplex transform heterogeneous feature fusion network for road segmentation

SGDFormer: One-stage transformer-based architecture for cross-spectral stereo image guided denoising

A method of dense point cloud SLAM based on improved YOLOV8 and fused with ORB-SLAM3 to cope with dynamic environments

REDef-DETR: real-time and efficient DETR for industrial surface defect detection