Abstract
We test several quantitative algorithms as palaeoclimate reconstruction tools for North American and European fossil pollen data, using both classical methods and newer machine-learning approaches based on regression tree ensembles and artificial neural networks. We focus on the reconstruction of secondary climate variables (here, January temperature and annual water balance), as their comparatively small ecological influence compared to the primary variable (July temperature) presents special challenges to palaeo-reconstructions. We test the pollen–climate models using a novel and comprehensive cross-validation approach, running a series of h-block cross-validations using h values of 100–1500 km. Our study illustrates major benefits of this variable h-block cross-validation scheme, as the effect of spatial autocorrelation is minimized, while the cross-validations with increasing h values can reveal instabilities in the calibration model and approximate challenges faced in palaeo-reconstructions with poor modern analogues. We achieve well-performing calibration models for both primary and secondary climate variables, with boosted regression trees providing the overall most robust performance, while the palaeoclimate reconstructions from fossil datasets show major independent features for the primary and secondary variables. Our results suggest that with careful variable selection and consideration of ecological processes, robust reconstruction of both primary and secondary climate variables is possible.
Highlights
IntroductionMicrofossil data (pollen, diatoms, foraminifera, chironomids, testate amoebae, ostracods) are widely employed as proxy indicators of past environmental variations, with applications in palaeoclimatology, environmental monitoring, and the study of ecosystem sensitivity, resilience and anthropogenic impact
Microfossil data are widely employed as proxy indicators of past environmental variations, with applications in palaeoclimatology, environmental monitoring, and the study of ecosystem sensitivity, resilience and anthropogenic impact
We show that the pollen–climate calibration models developed here perform well for both primary and secondary variables
Summary
Microfossil data (pollen, diatoms, foraminifera, chironomids, testate amoebae, ostracods) are widely employed as proxy indicators of past environmental variations, with applications in palaeoclimatology, environmental monitoring, and the study of ecosystem sensitivity, resilience and anthropogenic impact. The tree models give equal weight to rare and abundant taxa, which can help capture the signal of secondary variables in the variation of relatively rare fossil taxa[7,27,28] Beyond these conceptual strengths, an increasing number of recent studies suggest regression tree ensembles can provide well-performing calibration models for microfossil proxies[7,9,26,27,29] and for species distribution models applied to fossil pollen data[30]. This is the first use of two of these ML methods (ETREES, ELM) in palaeoclimate reconstruction
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.