Convolutional Block Research Articles

In improving agricultural yields and ensuring food security, precise detection of maize leaf diseases is of great importance. Traditional disease detection methods show limited performance in complex environments, making it challenging to meet the demands for precise detection in modern agriculture. This paper proposes a maize leaf disease detection model based on a state-space attention mechanism, aiming to effectively utilize the spatiotemporal characteristics of maize leaf diseases to achieve efficient and accurate detection. The model introduces a state-space attention mechanism combined with a multi-scale feature fusion module to capture the spatial distribution and dynamic development of maize diseases. In experimental comparisons, the proposed model demonstrates superior performance in the task of maize disease detection, achieving a precision, recall, accuracy, and F1 score of 0.94. Compared with baseline models such as AlexNet, GoogLeNet, ResNet, EfficientNet, and ViT, the proposed method achieves a precision of 0.95, with the other metrics also reaching 0.94, showing significant improvement. Additionally, ablation experiments verify the impact of different attention mechanisms and loss functions on model performance. The standard self-attention model achieved a precision, recall, accuracy, and F1 score of 0.74, 0.70, 0.72, and 0.72, respectively. The Convolutional Block Attention Module (CBAM) showed a precision of 0.87, recall of 0.83, accuracy of 0.85, and F1 score of 0.85, while the state-space attention module achieved a precision of 0.95, with the other metrics also at 0.94. In terms of loss functions, cross-entropy loss showed a precision, recall, accuracy, and F1 score of 0.69, 0.65, 0.67, and 0.67, respectively. Focal loss showed a precision of 0.83, recall of 0.80, accuracy of 0.81, and F1 score of 0.81. State-space loss demonstrated the best performance in these experiments, achieving a precision of 0.95, with recall, accuracy, and F1 score all at 0.94. These results indicate that the model based on the state-space attention mechanism achieves higher detection accuracy and better generalization ability in the task of maize leaf disease detection, effectively improving the accuracy and efficiency of disease recognition and providing strong technical support for the early diagnosis and management of maize diseases. Future work will focus on further optimizing the model’s spatiotemporal feature modeling capabilities and exploring multi-modal data fusion to enhance the model’s application in real agricultural scenarios.

Read full abstract

In the realm of autonomous driving, practical driving scenarios are fraught with numerous complexities, including inclement weather conditions, nighttime blurriness, and ambient light sources that significantly hinder drivers’ ability to discern road indicators. Furthermore, the dynamic nature of road indicators, which are constantly evolving, poses additional challenges for computer vision-based detection systems. To address these issues, this paper introduces a road indicator light detection model, leveraging the advanced capabilities of YOLOv8. We have ingeniously integrated the robust backbone of YOLOv8 with four distinct attention mechanism modules—Convolutional Block Attention Module (CBAM), Efficient Channel Attention (ECA), Shuffle Attention (SA), and Global Attention Mechanism (GAM)—to significantly enhance the model performance in capturing nuanced features of road indicators and boosting the accuracy of detecting minute objects. Additionally, we employ the Asymptotic Feature Pyramid Network (AFPN) strategy, which optimizes the fusion of features across different scales, ensuring not only an enhanced performance but also maintaining real-time capability. These innovative attention modules empower the model by recalibrating the significance of both channel and spatial dimensions within the feature maps, enabling it to hone in on the most pertinent object characteristics. To tackle the challenges posed by samples rich in small, occluded, background-similar objects, and those that are inherently difficult to recognize, we have incorporated the Focaler-IOU loss function. This loss function deftly reduces the contribution of easily detectable samples to the overall loss, thereby intensifying the focus on challenging samples. This strategic balancing of hard-to-detect versus easy-to-detect samples effectively elevates the model’s detection performance. Experimental evaluations conducted on both a public traffic signal dataset and a proprietary headlight dataset have yielded impressive results, with both mAP50 and mAP50:95 metrics experiencing significant improvements exceeding two percentage points. Notably, the enhancements observed in the headlight dataset are particularly profound, signifying a significant step forward toward realizing safer and more reliable assisted driving technologies.

Read full abstract

Convolutional Block Research Articles

Related Topics

Articles published on Convolutional Block

Reduction of Vision-Based Models for Fall Detection

RJ-TinyViT: an efficient vision transformer for red jujube defect classification.

A Hybrid Deep Learning Model for Enhanced Structural Damage Detection: Integrating ResNet50, GoogLeNet, and Attention Mechanisms

KOC_Net: Impact of the Synthetic Minority Over-Sampling Technique with Deep Learning Models for Classification of Knee Osteoarthritis Using Kellgren–Lawrence X-Ray Grade

A Lightweight Barcode Detection Algorithm Based on Deep Learning

Cucumber Leaf Segmentation Based on Bilayer Convolutional Network

Improved Prototypical Network Model for Classification of Farmland Shelterbelt Using Sentinel-2 Imagery

A Deep Learning Model for Accurate Maize Disease Detection Based on State-Space Attention and Feature Fusion

A Novel Defect Segmentation Model via Integrating Enhanced U-Net with Convolutional Block Attention Mechanisms

FusionGCN: Multi-focus image fusion using superpixel features generation GCN and pixel-level feature reconstruction CNN

DeepLigType: Predicting Ligand Types of Protein-Ligand Binding Sites Using a Deep Learning Model.

BreakNet: discontinuity-resilient multi-scale transformer segmentation of retinal layers

Combination of edge enhancement and cold diffusion model for low dose CT image denoising.

A Lightweight and Efficient Multimodal Feature Fusion Network for Bearing Fault Diagnosis in Industrial Applications

Enhancing the Rainfall Forecasting Accuracy of Ensemble Numerical Prediction Systems via Convolutional Neural Networks

Quantitative characterization of surface defects on bridge cable based on improved YOLACT++

Feature Enhancement Based Oriented Object Detection in Remote Sensing Images

A method for enhancing near-mirror object detection by integrating AWCS and CBAM into ResNet18

Adaptation of Object Detection Algorithms for Road Indicator Lights in Complex Scenes

FINE VESSEL SEGMENTATION WITH REFINEMENT GATE IN DEEP LEARNING ARCHITECTURES

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Convolutional Block Research Articles

Related Topics

Articles published on Convolutional Block

Reduction of Vision-Based Models for Fall Detection

RJ-TinyViT: an efficient vision transformer for red jujube defect classification.

A Hybrid Deep Learning Model for Enhanced Structural Damage Detection: Integrating ResNet50, GoogLeNet, and Attention Mechanisms

KOC_Net: Impact of the Synthetic Minority Over-Sampling Technique with Deep Learning Models for Classification of Knee Osteoarthritis Using Kellgren–Lawrence X-Ray Grade

A Lightweight Barcode Detection Algorithm Based on Deep Learning

Cucumber Leaf Segmentation Based on Bilayer Convolutional Network

Improved Prototypical Network Model for Classification of Farmland Shelterbelt Using Sentinel-2 Imagery

A Deep Learning Model for Accurate Maize Disease Detection Based on State-Space Attention and Feature Fusion

A Novel Defect Segmentation Model via Integrating Enhanced U-Net with Convolutional Block Attention Mechanisms

FusionGCN: Multi-focus image fusion using superpixel features generation GCN and pixel-level feature reconstruction CNN

DeepLigType: Predicting Ligand Types of Protein-Ligand Binding Sites Using a Deep Learning Model.

BreakNet: discontinuity-resilient multi-scale transformer segmentation of retinal layers

Combination of edge enhancement and cold diffusion model for low dose CT image denoising.

A Lightweight and Efficient Multimodal Feature Fusion Network for Bearing Fault Diagnosis in Industrial Applications

Enhancing the Rainfall Forecasting Accuracy of Ensemble Numerical Prediction Systems via Convolutional Neural Networks

Quantitative characterization of surface defects on bridge cable based on improved YOLACT++

Feature Enhancement Based Oriented Object Detection in Remote Sensing Images

A method for enhancing near-mirror object detection by integrating AWCS and CBAM into ResNet18

Adaptation of Object Detection Algorithms for Road Indicator Lights in Complex Scenes

FINE VESSEL SEGMENTATION WITH REFINEMENT GATE IN DEEP LEARNING ARCHITECTURES