Innovative agricultural solutions are needed to detect and classify leaf diseases early across crop species and environments. This study compares deep learning approaches, focusing on Convolutional Neural Networks (CNN) and Vision Transformers (VTs), to identify leaf diseases early and accurately for scalable crop management and productivity. Optimizing CNNs, Explainable Transfer Learning (ETPLDNet) using ResNet50 architecture, and LEViT leaf disease diagnosis are compared. The CNN model, optimized with dynamic hyperparameters, achieved an impressive 99.58% accuracy for leaf disease classification, demonstrating its effectiveness in feature extraction and classification precision. On the other hand, the VT-based LEViT model, which leverages self-attention mechanisms and Explainable AI (XAI), achieved 95.22% accuracy but offers enhanced interpretability and generalization capabilities due to its transformer-based architecture. This distinction illustrates that while CNNs excel in accuracy, VTs provide a more transparent decision-making process and better handle the complex variances in plant leaf diseases, making them ideal for precision agriculture. The combined use of CNNs and VTs showcases the strengths of each model, with CNN focusing on high classification precision and VTs offering improved interpretability and adaptability for various leaf disease conditions. The use of XAI enables the models to highlight important areas in plant leaf images that influence the model's decisions, offering a transparent and interpretable decision-making process that allows researchers and farmers to understand why a particular diagnosis or classification was made. This ability to visualize and explain the reasoning behind the model predictions is crucial to increasing trust in AI-driven solutions in agriculture. By combining the high precision of CNN and the interpretability of VT with XAI, this study offers a robust approach to improving crop disease management and precision agriculture.
Read full abstract