Abstract

A reliable and accurate forecasting model for crop yields is crucial for effective decision-making in every agricultural sector. Machine learning approaches allow for building such predictive models, but the quality of predictions decreases if data is scarce. In this work, we proposed data-augmentation for wheat yield forecasting in the presence of small data sets of two distinct Provinces in Algeria. We first increased the dimension of each data set by adding more features, and then we augmented the size of the data by merging the two data sets. To assess the effectiveness of data-augmentation approaches, we conducted three sets of experiments based on three data sets: the primary data sets, data sets with additional features and the augmented data sets obtained by merging, using five regression models (Support Vector Regression, Random Forest, Extreme Learning Machine, Artificial Neural Network, Deep Neural Network). To evaluate the models, we used cross-validation; the results showed an overall increase in performance with the augmented data. DNN outperformed the other models for the first Province with a Root Mean Square Error (RMSE) of 0.04 q/ha and R_Squared (R2) of 0.96, whereas the Random Forest outperformed the other models for the second Province with RMSE of 0.05 q/ha. The data-augmentation approach proposed in this study showed encouraging results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.