Abstract

Causal explanation and empirical prediction are usually addressed separately when modelling ecological systems. This potentially leads to erroneous conflation of model explanatory and predictive power, to predictive models that lack ecological interpretability, or to limited feedback between predictive modelling and theory development. These are fundamental challenges to appropriate statistical and scientific use of ecological models. To help address such challenges, we propose a novel, integrated modelling framework which couples explanatory modelling for causal understanding and input variable selection with a machine learning approach for empirical prediction. Exemplar datasets from the field of freshwater ecology are used to develop and evaluate the framework, based on 267 stream and river monitoring stations across England, UK. These data describe spatial patterns in benthic macroinvertebrate community indices that are hypothesised to be driven by meso-scale physical and chemical habitat conditions. Whilst explanatory models developed using structural equation modelling performed strongly (r2 for two macroinvertebrate indices = 0.64–0.70), predictive models based on extremely randomised trees demonstrated moderate performance (r2 for the same indices = 0.50–0.61). However, through coupling explanatory and predictive components, our proposed framework yields ecologically-interpretable predictive models which also maintain the parsimony and accuracy of models based on solely predictive approaches. This significantly enhances the opportunity for feedback among causal theory, empirical data and prediction within environmental modelling.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call