Achieving timely and non-destructive assessments of crop yields is a key challenge in the agricultural field, as it is important for optimizing field management measures and improving crop productivity. To accurately and quickly predict citrus yield, this study obtained multispectral images of citrus fruit maturity through an unmanned aerial vehicle (UAV) and extracted multispectral vegetation indices (VIs) and texture features (T) from the images as feature variables. Extreme gradient boosting (XGB), random forest (RF), support vector machine (SVM), gaussian process regression (GPR), and multiple stepwise regression (MSR) models were used to construct citrus fruit number and quality prediction models. The results show that, for fruit number prediction, the XGB model performed best under the combined input of VIs and T, with an R2 = 0.792 and an RMSE = 462 fruits. However, for fruit quality prediction, the RF model performed best when only the VIs were used, with an R2 = 0.787 and an RMSE = 20.0 kg. Although the model accuracy was acceptable, the number of input feature variables used was large. To further improve the model prediction performance, we explored a method that utilizes a hybrid coding particle swarm optimization algorithm (CPSO) coupled with XGB and SVM models. The coupled models had a significant improvement in predicting the number and quality of citrus fruits, especially the model of CPSO coupled with XGB (CPSO-XGB). The CPSO-XGB model had fewer input features and higher accuracy, with an R2 > 0.85. Finally, the Shapley additive explanations (SHAP) method was used to reveal the importance of the normalized difference chlorophyll index (NDCI) and the red band mean feature (MEA_R) when constructing the prediction model. The results of this study provide an application reference and a theoretical basis for the research on UAV remote sensing in relation to citrus yield.
Read full abstract