Abstract

Winter wheat (Triticum aestivum L.) is one of the most important cereal crops, supplying essential food for the world population. Because the United States is a major producer and exporter of wheat to the world market, accurate and timely forecasting of wheat yield in the United States (U.S.) is fundamental to national crop management as well as global food security. Previous studies mainly have focused on developing empirical models using only satellite remote sensing images, while other yield determinants have not yet been adequately explored. In addition, these models are based on traditional statistical regression algorithms, while more advanced machine learning approaches have not been explored. This study used advanced machine learning algorithms to establish within-season yield prediction models for winter wheat using multi-source data to address these issues. Specifically, yield driving factors were extracted from four different data sources, including satellite images, climate data, soil maps, and historical yield records. Subsequently, two linear regression methods, including ordinary least square (OLS) and least absolute shrinkage and selection operator (LASSO), and four well-known machine learning methods, including support vector machine (SVM), random forest (RF), Adaptive Boosting (AdaBoost), and deep neural network (DNN), were applied and compared for estimating the county-level winter wheat yield in the Conterminous United States (CONUS) within the growing season. Our models were trained on data from 2008 to 2016 and evaluated on data from 2017 and 2018, with the results demonstrating that the machine learning approaches performed better than the linear regression models, with the best performance being achieved using the AdaBoost model (R2 = 0.86, RMSE = 0.51 t/ha, MAE = 0.39 t/ha). Additionally, the results showed that combining data from multiple sources outperformed single source satellite data, with the highest accuracy being obtained when the four data sources were all considered in the model development. Finally, the prediction accuracy was also evaluated against timeliness within the growing season, with reliable predictions (R2 > 0.84) being able to be achieved 2.5 months before the harvest when the multi-source data were combined.

Highlights

  • As one of the top cereals that supply world food, wheat is a rich source of calories and it is an essential staple in most regions of the world [1]

  • The goal of the study was to answer the following three questions: (1) Which is the best model for predicting Conterminous United States (CONUS) winter wheat yield? (2) How much improvement can be obtained by combing the multi-source data? (3) What is the near real-time prediction performance within the growing season?

  • We developed machine learning models to predict county-level winter wheat yield for the CONUS, by using multi-source data

Read more

Summary

Introduction

As one of the top cereals that supply world food, wheat is a rich source of calories and it is an essential staple in most regions of the world [1]. In 2017, the global cereal crop product was 2.98 billion tons, of which 780 million tons were wheat [2]. Despite the tremendous wheat product grown annually, the total demand is still difficult to meet, leading to a recent increase in food prices [3,4]. The United States is one of the leading wheat producers in the world, with an annual production of over 51 million tons in 2018 [5]. Winter wheat (Triticum aestivum L.), which is planted in the preceding fall, is a primary variety, representing more than 70% of the total U.S production [6]. Due to its large scale, timely and accurate forecasts of the winter wheat yield in the United States (U.S.) are of great significance for regional and global food security

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.