Abstract

Assessing the response of crop yield to year-to-year climate variability at the field scale is often done using process-based models and regression techniques. Although powerful, these tools rely on strong assumptions and can lead to substantial prediction errors. In this study, we investigate the use of a flexible machine learning algorithm combining Functional Principal Component Analysis and Random Forest, to relate field scale wheat yield to local daily climate variables. Instead of computing seasonal, monthly or any other arbitrary time-frame climate averages, climate time series are decomposed by Functional Principal Component Analysis into a few data-driven basis functions, called Principal Curves, in order to summarize the dynamic of key climate variables by a limited number of interpretable components. Scores associated to these components are then used as inputs of a Random Forest algorithm for yield prediction and for analysing important factors responsible for yield variability. To evaluate our approach, we use a French national database including wheat yield data as well as climate and management practice data for 298 farm fields from 2011 to 2016 in four main producing regions. Depending on the regions, our approach can explain from 62% to 81% of the yield variability when both agronomic and climate variables are included, down to 56% to 81% when ignoring agronomic variables and 51% to 74% when ignoring climate variables. Based on a year-by-year cross-validation, RMSE ranges from 0.5 t ha−1 to 2.1 t ha−1 in non-extreme years (2012-2015). However, prediction error can reach 3.6 t ha−1 in case of exceptional weather conditions, such as those experienced in 2016 in Northern France. We find that this new approach performs in most cases better than the same Machine Learning algorithm using the usual time averages of climate variables, without the need to choose an arbitrary time-frame. We then show how important patterns in weather time series can be identified and how their effects on yield can be interpreted using the proposed modelling framework.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call