Abstract

Food security is at stake, with climate change heavily impacting agriculture and food production. In the present context of extreme events and changing conditions, developing advanced crop yield models can learn from all available information and provide interpretable predictions for decision-making is of paramount relevance. This work explores the potential and limitations of developing interpretable crop yield models using long short-term memory (LSTM) neural networks, which typically excel at extracting information from time series. LSTMs were designed and trained with multi-source satellite and meteorological time series over Continental US (CONUS) and corn, soybean, and wheat yield data from the US Department of Agriculture. Two recent attribution methods are used to interpret and extract knowledge from the developed models: integrated gradients (IG), based on back-propagation, and Shapley values (SHAP), based on perturbations. Our results show that (1) LSTM models achieved high accuracy (R <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> > 0.56); (2) multi-source combinations outperformed single variable models in all crop models; (3) both attribution methods were equivalent in detecting essential drivers and their contribution; (4) satellite estimates of enhanced vegetation index (EVI) and vegetation optical depth (VOD) together with meteorological estimates of maximum temperature (TMX) were the most relevant input features for crop yield estimations; and finally, (5) we discovered critical periods of the crop growth cycle for the corn, soybean, and wheat models. The suggested strategy constitutes an important step towards modeling and understanding crop production systems and advancing in automatic data-driven and accountable field management.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call