Input Feature Map Research Articles

Road cracks significantly shorten the service life of roads. Manual detection methods are inefficient and costly. The YOLOv5 model has made some progress in road crack detection. However, issues arise when deployed on edge computing devices. The main problem is that edge computing devices are directly connected to sensors. This results in the collection of noisy, poor-quality data. This problem adds computational burden to the model, potentially impacting its accuracy. To address these issues, this paper proposes a novel road crack detection algorithm named EMG-YOLO. First, an Efficient Decoupled Header is introduced in YOLOv5 to optimize the head structure. This approach separates the classification task from the localization task. Each task can then focus on learning its most relevant features. This significantly reduces the model's computational resources and time. It also achieves faster convergence rates. Second, the IOU loss function in the model is upgraded to the MPDIOU loss function. This function works by minimizing the top-left and bottom-right point distances between the predicted bounding box and the actual labeled bounding box. The MPDIOU loss function addresses the complex computation and high computational burden of the current YOLOv5 model. Finally, the GCC3 module replaces the traditional convolution. It performs global context modeling with the input feature map to obtain global context information. This enhances the model's detection capabilities on edge computing devices. Experimental results show that the improved model has better performance in all parameter indicators compared to current mainstream algorithms. The EMG-YOLO model improves the accuracy of the YOLOv5 model by 2.7%. The mAP (0.5) and mAP (0.9) are improved by 2.9% and 0.9%, respectively. The new algorithm also outperforms the YOLOv5 model in complex environments on edge computing devices. The EMG-YOLO algorithm proposed in this paper effectively addresses the issues of poor data quality and high computational burden on edge computing devices. This is achieved through optimizing the model head structure, upgrading the loss function, and introducing global context modeling. Experimental results demonstrate significant improvements in both accuracy and efficiency, especially in complex environments. Future research can further optimize this algorithm and explore more lightweight and efficient object detection models for edge computing devices.

Read full abstract

Yunnan Xiaomila is a pepper variety whose flowers and fruits become mature at the same time and multiple times a year. The distinction between the fruits and the background is low and the background is complex. The targets are small and difficult to identify. This paper aims at the problem of target detection of Yunnan Xiaomila under complex background environment, in order to reduce the impact caused by the small color gradient changes between xiaomila and background and the unclear feature information, an improved PAE-YOLO model is proposed, which combines the EMA attention mechanism and DCNv3 deformable convolution is integrated into the YOLOv8 model, which improves the model's feature extraction capability and inference speed for Xiaomila in complex environments, and achieves a lightweight model. First, the EMA attention mechanism is combined with the C2f module in the YOLOv8 network. The C2f module can well extract local features from the input image, and the EMA attention mechanism can control the global relationship. The two complement each other, thereby enhancing the model's expression ability; Meanwhile, in the backbone network and head network, the DCNv3 convolution module is introduced, which can adaptively adjust the sampling position according to the input feature map, contributing to stronger feature capture capabilities for targets of different scales and a lightweight network. It also uses a depth camera to estimate the posture of Xiaomila, while analyzing and optimizing different occlusion situations. The effectiveness of the proposed method was verified through ablation experiments, model comparison experiments and attitude estimation experiments. The experimental results indicated that the model obtained an average mean accuracy (mAP) of 88.8%, which was 1.3% higher than that of the original model. Its F1 score reached 83.2, and the GFLOPs and model sizes were 7.6G and 5.7MB respectively. The F1 score ranked the best among several networks, with the model weight and gigabit floating-point operations per second (GFLOPs) being the smallest, which are 6.2% and 8.1% lower than the original model. The loss value was the lowest during training, and the convergence speed was the fastest. Meanwhile, the attitude estimation results of 102 targets showed that the orientation was correctly estimated exceed 85% of the cases, and the average error angle was 15.91°. In the occlusion condition, 86.3% of the attitude estimation error angles were less than 40°, and the average error angle was 23.19°. The results show that the improved detection model can accurately identify Xiaomila targets fruits, has higher model accuracy, less computational complexity, and can better estimate the target posture.

Read full abstract

Input Feature Map Research Articles

Related Topics

Articles published on Input Feature Map

Fall detection method based on spatio-temporal coordinate attention for high-resolution networks

TCDDU-Net: combining transformer and convolutional dual-path decoding U-Net for retinal vessel segmentation

Improved YOLOv8n for Lightweight Ship Detection

ECMTrans-net: Multi-Class Segmentation Network Based on Tumor Tissue in Endometrial Cancer Pathology Images

G-YOLO: A Lightweight Infrared Aerial Remote Sensing Target Detection Model for UAVs Based on YOLOv8

Deep encoder-decoder networks for belt longitudinal tear detection

Reconstructing hyperspectral images of textiles from a single RGB image utilizing the multihead self-attention mechanism

Improved Detection of Multi-Class Bad Traffic Signs Using Ensemble and Test Time Augmentation Based on Yolov5 Models

Spatial attention U-Net model with Harris hawks optimization for retinal blood vessel and optic disc segmentation in fundus images.

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

A study on a target detection model for autonomous driving tasks

An Infrared Aircraft Detection Algorithm Based on Context Perception Feature Enhancement

Enhancing Classification of Alzheimer’s Disease using Spatial Attention Mechanism

EMG-YOLO: road crack detection algorithm for edge computing devices.

Radiological image analysis using effective channel extension and fusion network based on COVID CT images

A lightweight Yunnan Xiaomila detection and pose estimation based on improved YOLOv8.

Fine Segmentation of Chinese Character Strokes Based on Coordinate Awareness and Enhanced BiFPN.

A Robust CoS-PVNet Pose Estimation Network in Complex Scenarios

Self-supervised monocular visual odometry based on cross-correlation

Alzheimer’s Disease Detection via Multiscale Feature Modelling Using Improved Spatial Attention Guided Depth Separable CNN

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Input Feature Map Research Articles

Related Topics

Articles published on Input Feature Map

Fall detection method based on spatio-temporal coordinate attention for high-resolution networks

TCDDU-Net: combining transformer and convolutional dual-path decoding U-Net for retinal vessel segmentation

Improved YOLOv8n for Lightweight Ship Detection

ECMTrans-net: Multi-Class Segmentation Network Based on Tumor Tissue in Endometrial Cancer Pathology Images

G-YOLO: A Lightweight Infrared Aerial Remote Sensing Target Detection Model for UAVs Based on YOLOv8

Deep encoder-decoder networks for belt longitudinal tear detection

Reconstructing hyperspectral images of textiles from a single RGB image utilizing the multihead self-attention mechanism

Improved Detection of Multi-Class Bad Traffic Signs Using Ensemble and Test Time Augmentation Based on Yolov5 Models

Spatial attention U-Net model with Harris hawks optimization for retinal blood vessel and optic disc segmentation in fundus images.

Improving the performance of deep learning models in predicting and classifying gamma passing rates with discriminative features and a class balancing technique: a retrospective cohort study

A study on a target detection model for autonomous driving tasks

An Infrared Aircraft Detection Algorithm Based on Context Perception Feature Enhancement

Enhancing Classification of Alzheimer’s Disease using Spatial Attention Mechanism

EMG-YOLO: road crack detection algorithm for edge computing devices.

Radiological image analysis using effective channel extension and fusion network based on COVID CT images

A lightweight Yunnan Xiaomila detection and pose estimation based on improved YOLOv8.

Fine Segmentation of Chinese Character Strokes Based on Coordinate Awareness and Enhanced BiFPN.

A Robust CoS-PVNet Pose Estimation Network in Complex Scenarios

Self-supervised monocular visual odometry based on cross-correlation

Alzheimer’s Disease Detection via Multiscale Feature Modelling Using Improved Spatial Attention Guided Depth Separable CNN