Forecasting Corn Yield With Machine Learning Ensembles.

Mohsen Shahhosseini,Guiping Hu,Sotirios V Archontoulis

doi:10.3389/fpls.2020.01120

Mohsen Shahhosseini, Guiping Hu + Show 1 more

Open Access

https://doi.org/10.3389/fpls.2020.01120

Copy DOI

Journal: Frontiers in Plant Science	Publication Date: Jul 31, 2020
Citations: 125	License type: CC BY 4.0

Affiliation: Iowa State University

Abstract

The emergence of new technologies to synthesize and analyze big data with high-performance computing has increased our capacity to more accurately predict crop yields. Recent research has shown that machine learning (ML) can provide reasonable predictions faster and with higher flexibility compared to simulation crop modeling. However, a single machine learning model can be outperformed by a “committee” of models (machine learning ensembles) that can reduce prediction bias, variance, or both and is able to better capture the underlying distribution of the data. Yet, there are many aspects to be investigated with regard to prediction accuracy, time of the prediction, and scale. The earlier the prediction during the growing season the better, but this has not been thoroughly investigated as previous studies considered all data available to predict yields. This paper provides a machine leaning based framework to forecast corn yields in three US Corn Belt states (Illinois, Indiana, and Iowa) considering complete and partial in-season weather knowledge. Several ensemble models are designed using blocked sequential procedure to generate out-of-bag predictions. The forecasts are made in county-level scale and aggregated for agricultural district and state level scales. Results show that the proposed optimized weighted ensemble and the average ensemble are the most precise models with RRMSE of 9.5%. Stacked LASSO makes the least biased predictions (MBE of 53 kg/ha), while other ensemble models also outperformed the base learners in terms of bias. On the contrary, although random k-fold cross-validation is replaced by blocked sequential procedure, it is shown that stacked ensembles perform not as good as weighted ensemble models for time series data sets as they require the data to be non-IID to perform favorably. Comparing our proposed model forecasts with the literature demonstrates the acceptable performance of forecasts made by our proposed ensemble model. Results from the scenario of having partial in-season weather knowledge reveals that decent yield forecasts with RRMSE of 9.2% can be made as early as June 1st. Moreover, it was shown that the proposed model performed better than individual models and benchmark ensembles at agricultural district and state-level scales as well as county-level scale. To find the marginal effect of each input feature on the forecasts made by the proposed ensemble model, a methodology is suggested that is the basis for finding feature importance for the ensemble model. The findings suggest that weather features corresponding to weather in weeks 18–24 (May 1st to June 1st) are the most important input features.

Highlights

Providing 11% of total US employment, agriculture and its related industries are considered as a significant contributor to the US economy, with $1.053 trillion of US gross domestic product (GDP) in 2017 (USDA Economic Research Center, 2019)
After presenting the numerical results of designed forecasting machine learning (ML) models and comparing them with the literature, this section discusses the effect of in-season weather information on the quality of forecasts by comparing the prediction accuracy of designed ensemble models on different subsets of in-season weather information
The prediction results of the scenario of having partial in-season weather demonstrated that ample corn yield forecasts can be made as early as June 1st

Summary

Introduction

Providing 11% of total US employment, agriculture and its related industries are considered as a significant contributor to the US economy, with $1.053 trillion of US gross domestic product (GDP) in 2017 (USDA Economic Research Center, 2019). Crop yield prediction is of high significance since it can provide insights and information for improving crop management, economic trading, food production monitoring, and global food security. The emergence of new technologies, such as simulation crop models and machine learning in the recent years, and the ability to analyze big data with high-performance computing has resulted in more accurate yield predictions (Drummond et al, 2003; Vincenzi et al, 2011; González Sánchez et al, 2014; Jeong et al, 2016; Pantazi et al, 2016; Cai et al, 2017; Chlingaryan et al, 2018; Crane-Droesch, 2018; Basso and Liu, 2019; Shahhosseini et al, 2019c). We do not know if predictions are more accurate at a finer (county) or course (agricultural district) scale. Previous research by Sakamoto et al (2014) and Peng et al (2018) suggested better prediction accuracy for course scale compared to a finer scale

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Forecasting Corn Yield With Machine Learning Ensembles.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Plant Science

Lead the way for us

Similar Papers

County-scale crop yield prediction by integrating crop simulation with machine learning models.
Saiara Samira Sajid ... Sotirios V Archontoulis
Frontiers in Plant Science | VOL. 13
Saiara Samira Sajid, et. al.Saiara Samira Sajid ... Sotirios V Archontoulis
28 Nov 2022
Frontiers in Plant Science | VOL. 13

Coupling machine learning and crop modeling improves crop yield prediction in the US Corn Belt
Mohsen Shahhosseini ... Isaiah Huber
Scientific Reports | VOL. 11
Mohsen Shahhosseini, et. al.Mohsen Shahhosseini ... Isaiah Huber
15 Jan 2021
Scientific Reports | VOL. 11

Improving early crop yield and price predictions using satellite imagery with machine and deep learning techniques
Florian Teste ... Philippe Ciais
-
Florian Teste, et. al.Florian Teste ... Philippe Ciais
18 Mar 2024
18 Mar 2024

Integrating Crop Simulation and Machine Learning Models to Improve Crop Yield Prediction
Saiara Samira Sajid ... Guiping Hu
-
Saiara Samira Sajid, et. al.Saiara Samira Sajid ... Guiping Hu
07 Jun 2022
07 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Forecasting Corn Yield With Machine Learning Ensembles.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Plant Science