Abstract
The outbreak of cassava diseases poses a serious threat to agricultural economic security and food production systems in tropical regions. Traditional manual monitoring methods are limited by efficiency bottlenecks and insufficient spatial coverage. Although low-altitude drone technology offers advantages such as high resolution and strong timeliness, it faces dual challenges in the field of disease identification, such as complex background interference and irregular disease morphology. To address these issues, this study proposes an intelligent classification method for cassava diseases based on drone imagery and an ED-Swin Transformer. Firstly, we introduced the EMAGE (Efficient Multi-Scale Attention with Grouping and Expansion) module, which integrates the global distribution features and local texture details of diseased leaves in drone imagery through a multi-scale grouped attention mechanism, effectively mitigating the interference of complex background noise on feature extraction. Secondly, the DASPP (Deformable Atrous Spatial Pyramid Pooling) module was designed to use deformable atrous convolution to adaptively match the irregular boundaries of diseased areas, enhancing the model's robustness to morphological variations caused by angles and occlusions in low-altitude drone photography. The results show that the ED-Swin Transformer model achieved excellent performance across five evaluation metrics, with scores of 94.32%, 94.56%, 98.56%, 89.22%, and 96.52%, representing improvements of 1.28%, 2.32%, 0.38%, 3.12%, and 1.4%, respectively. These experiments demonstrate the superior performance of the ED-Swin Transformer model in cassava classification networks.
Published Version
Join us for a 30 min session where you can share your feedback and ask us any queries you have