Rep-ViG-Apple: A CNN-GCN Hybrid Model for Apple Detection in Complex Orchard Environments

Bo Han,Jingjing Zhang,Ziao Lu,Wei Sun,Rolla Almodfer,Luan Dong,Zhengting Wang

doi:10.3390/agronomy14081733

Abstract

Accurately recognizing apples in complex environments is essential for automating apple picking operations, particularly under challenging natural conditions such as cloudy, snowy, foggy, and rainy weather, as well as low-light situations. To overcome the challenges of reduced apple target detection accuracy due to branch occlusion, apple overlap, and variations between near and far field scales, we propose the Rep-ViG-Apple algorithm, an advanced version of the YOLO model. The Rep-ViG-Apple algorithm features a sophisticated architecture designed to enhance apple detection performance in difficult conditions. To improve feature extraction for occluded and overlapped apple targets, we developed the inverted residual multi-scale structural reparameterized feature extraction block (RepIRD Block) within the backbone network. We also integrated the sparse graph attention mechanism (SVGA) to capture global feature information, concentrate attention on apples, and reduce interference from complex environmental features. Moreover, we designed a feature extraction network with a CNN-GCN architecture, termed Rep-Vision-GCN. This network combines the local multi-scale feature extraction capabilities of a convolutional neural network (CNN) with the global modeling strengths of a graph convolutional network (GCN), enhancing the extraction of apple features. The RepConvsBlock module, embedded in the neck network, forms the Rep-FPN-PAN feature fusion network, which improves the recognition of apple targets across various scales, both near and far. Furthermore, we implemented a channel pruning algorithm based on LAMP scores to balance computational efficiency with model accuracy. Experimental results demonstrate that the Rep-ViG-Apple algorithm achieves precision, recall, and average accuracy of 92.5%, 85.0%, and 93.3%, respectively, marking improvements of 1.5%, 1.5%, and 2.0% over YOLOv8n. Additionally, the Rep-ViG-Apple model benefits from a 22% reduction in size, enhancing its efficiency and suitability for deployment in resource-constrained environments while maintaining high accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rep-ViG-Apple: A CNN-GCN Hybrid Model for Apple Detection in Complex Orchard Environments

Abstract

Talk to us

Similar Papers

More From: Agronomy

Lead the way for us

Journal: Agronomy	Publication Date: Aug 7, 2024
License type: CC BY 4.0

Similar Papers

Notice of Violation of IEEE Publication Principles: Ground-Based Cloud Image Recognition System Based on Multi-CNN and Feature Screening and Fusion
Ma Jingyi ... Tiejun Zhang
IEEE Access | VOL. 8
Ma Jingyi, et. al.Ma Jingyi ... Tiejun Zhang
01 Jan 2020
IEEE Access | VOL. 8

Wind Turbine Bearing Failure Diagnosis Using Multi-Scale Feature Extraction and Residual Neural Networks with Block Attention
Yuanqing Luo ... Xueyong Tian
Actuators | VOL. 13
Yuanqing Luo, et. al.Yuanqing Luo ... Xueyong Tian
05 Oct 2024
Actuators | VOL. 13

Multi-scale Feature Extraction and Fusion Net: Research on UAVs Image Semantic Segmentation Technology
Xiaogang Li ... Zhansheng Tian
Journal of ICT Standardization | VOL. -
Xiaogang Li, et. al.Xiaogang Li ... Zhansheng Tian
14 Jan 2023
Journal of ICT Standardization | VOL. -

A two-branch multiscale spectral-spatial feature extraction network for hyperspectral image classification
Aamir Ali ... Yi Liu
Journal of Information and Intelligence | VOL. 2
Aamir Ali, et. al.Aamir Ali ... Yi Liu
09 Mar 2024
Journal of Information and Intelligence | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rep-ViG-Apple: A CNN-GCN Hybrid Model for Apple Detection in Complex Orchard Environments

Abstract

Talk to us

Similar Papers

More From: Agronomy