Spatial Feature Fusion Research Articles

Forest fires occur frequently around the world, causing serious economic losses and human casualties. Deep learning techniques based on convolutional neural networks (CNN) are widely used in the intelligent detection of forest fires. However, CNN-based forest fire target detection models lack global modeling capabilities and cannot fully extract global and contextual information about forest fire targets. CNNs also pay insufficient attention to forest fires and are vulnerable to the interference of invalid features similar to forest fires, resulting in low accuracy of fire detection. In addition, CNN-based forest fire target detection models require a large number of labeled datasets. Manual annotation is often used to annotate the huge amount of forest fire datasets; however, this takes a lot of time. To address these problems, this paper proposes a forest fire detection model, TCA-YOLO, with YOLOv5 as the basic framework. Firstly, we combine the Transformer encoder with its powerful global modeling capability and self-attention mechanism with CNN as a feature extraction network to enhance the extraction of global information on forest fire targets. Secondly, in order to enhance the model’s focus on forest fire targets, we integrate the Coordinate Attention (CA) mechanism. CA not only acquires inter-channel information but also considers direction-related location information, which helps the model to better locate and identify forest fire targets. Integrated adaptively spatial feature fusion (ASFF) technology allows the model to automatically filter out useless information from other layers and efficiently fuse features to suppress the interference of complex backgrounds in the forest area for detection. Finally, semi-supervised learning is used to save a large amount of manual labeling effort. The experimental results show that the average accuracy of TCA-YOLO improves by 5.3 compared with the unimproved YOLOv5. TCA-YOLO also outperformed in detecting forest fire targets in different scenarios. The ability of TCA-YOLO to extract global information on forest fire targets was much improved. Additionally, it could locate forest fire targets more accurately. TCA-YOLO misses fewer forest fire targets and is less likely to be interfered with by forest fire-like targets. TCA-YOLO is also more focused on forest fire targets and better at small-target forest fire detection. FPS reaches 53.7, which means that the detection speed meets the requirements of real-time forest fire detection.

Object detection in drone-captured scenarios is a recent popular task. Due to the high flight altitude of unmanned aerial vehicle (UAV), the large variation of target scales, and the existence of dense occlusion of targets, in addition to the high requirements for real-time detection. To solve the above problems, we propose a real-time UAV small target detection algorithm based on improved ASFF-YOLOv5s. Based on the original YOLOv5s algorithm, the new shallow feature map is passed into the feature fusion network through multi-scale feature fusion to improve the extraction capability for small target features, and the Adaptively Spatial Feature Fusion (ASFF) is improved to improve the multi-scale information fusion capability. To obtain anchor frames for the VisDrone2021 dataset, we improve the K-means algorithm to obtain four different scales of anchor frames on each prediction layer. The Convolutional Block Attention Module (CBAM) is added in front of the backbone network and each prediction network layer to improve the capture capability of important features and suppress redundant features. Finally, to address the shortcomings of the original GIoU loss function, the SIoU loss function is used to accelerate the convergence of the model and improve accuracy. Extensive experiments conducted on the dataset VisDrone2021 show that the proposed model can detect a wide range of small targets in various challenging environments. At a detection rate of 70.4 FPS, the proposed model obtained a precision value of 32.55%, F1-score of 39.62%, and a mAP value of 38.03%, which improved 2.77, 3.98, and 5.1%, respectively, compared with the original algorithm, for the detection performance of small targets and to meet the task of real-time detection of UAV aerial images. The current work provides an effective method for real-time detection of small targets in UAV aerial photography in complex scenes, and can be extended to detect pedestrians, cars, etc. in urban security surveillance.

Spatial Feature Fusion Research Articles

Related Topics

Articles published on Spatial Feature Fusion

FAS-Res2net: An Improved Res2net-Based Script Identification Method for Natural Scenes

GA-YOLO: A Lightweight YOLO Model for Dense and Occluded Grape Target Detection

A Spectral–Spatial Transformer Fusion Method for Hyperspectral Video Tracking

Passenger Flow Detection in Subway Stations Based on Improved You Only Look Once Algorithm

TSBA-YOLO: An Improved Tea Diseases Detection Model Based on Attention Mechanisms and Feature Fusion

Adaptively spatial feature fusion network: an improved UAV detection method for wheat scab

Multimodal motor imagery decoding method based on temporal spatial feature alignment and fusion

Vehicle door frame positioning method for binocular vision robots based on improved YOLOv4

A Semi-Supervised Method for Real-Time Forest Fire Detection Algorithm Based on Adaptively Spatial Feature Fusion

An Infusion Containers Detection Method Based on YOLOv4 with Enhanced Image Feature Fusion

Breast Tumor Classification in Ultrasound Images by Fusion of Deep Convolutional Neural Network and Shallow LBP Feature.

Static Gesture Recognition Algorithm Based on Improved YOLOv5s

Detection of Laodelphax striatellus (small brown planthopper) based on improved YOLOv5

OCT image denoising algorithm based on discrete wavelet transform and spatial domain feature fusion

SFAF-MA: Spatial Feature Aggregation and Fusion With Modality Adaptation for RGB-Thermal Semantic Segmentation

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

Spatial Pyramid Pooling and Adaptively Feature Fusion based Yolov3 for Traffic Sign Detection

R-YOLOv5: A Lightweight Rotational Object Detection Algorithm for Real-Time Detection of Vehicles in Dense Scenes

An improved UAV target detection algorithm based on ASFF-YOLOv5s.

Dual-View Spectral and Global Spatial Feature Fusion Network for Hyperspectral Image Classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Spatial Feature Fusion Research Articles

Related Topics

Articles published on Spatial Feature Fusion

FAS-Res2net: An Improved Res2net-Based Script Identification Method for Natural Scenes

GA-YOLO: A Lightweight YOLO Model for Dense and Occluded Grape Target Detection

A Spectral–Spatial Transformer Fusion Method for Hyperspectral Video Tracking

Passenger Flow Detection in Subway Stations Based on Improved You Only Look Once Algorithm

TSBA-YOLO: An Improved Tea Diseases Detection Model Based on Attention Mechanisms and Feature Fusion

Adaptively spatial feature fusion network: an improved UAV detection method for wheat scab

Multimodal motor imagery decoding method based on temporal spatial feature alignment and fusion

Vehicle door frame positioning method for binocular vision robots based on improved YOLOv4

A Semi-Supervised Method for Real-Time Forest Fire Detection Algorithm Based on Adaptively Spatial Feature Fusion

An Infusion Containers Detection Method Based on YOLOv4 with Enhanced Image Feature Fusion

Breast Tumor Classification in Ultrasound Images by Fusion of Deep Convolutional Neural Network and Shallow LBP Feature.

Static Gesture Recognition Algorithm Based on Improved YOLOv5s

Detection of Laodelphax striatellus (small brown planthopper) based on improved YOLOv5

OCT image denoising algorithm based on discrete wavelet transform and spatial domain feature fusion

SFAF-MA: Spatial Feature Aggregation and Fusion With Modality Adaptation for RGB-Thermal Semantic Segmentation

STFF-SM: Steganalysis Model Based on Spatial and Temporal Feature Fusion for Speech Streams

Spatial Pyramid Pooling and Adaptively Feature Fusion based Yolov3 for Traffic Sign Detection

R-YOLOv5: A Lightweight Rotational Object Detection Algorithm for Real-Time Detection of Vehicles in Dense Scenes

An improved UAV target detection algorithm based on ASFF-YOLOv5s.

Dual-View Spectral and Global Spatial Feature Fusion Network for Hyperspectral Image Classification