Abstract

AbstractAimIntegrated species distribution modelling has emerged as a useful tool for ecologists to exploit the range of information available on where species occur. In particular, the ability to combine large numbers of ad hoc or presence‐only (PO) records with more structured presence–absence (PA) data can allow ecologists to account for biases in PO data which often confound modelling efforts. A range of modelling techniques have been suggested to implement integrated species distribution models (IDMs) including joint likelihood models, including one dataset as a covariate or informative prior, and fitting a correlation structure between datasets. We aim to investigate the performance of different types of integrated models under realistic ecological data scenarios.InnovationWe use a virtual ecologist approach to investigate which integrated model is most advantageous under varying levels of spatial bias in PO data, sample size of PA data and spatial overlap between datasets.Main conclusionsJoint likelihood models were the best performing models when spatial bias in PO data was low, or could be modelled, but gave poor estimates when there were unknown biases in the data. Correlation models provided good model estimates even when there were unknown biases and when good quality PA data were spatially limited. Including PO data via an informative prior provided little improvement over modelling PA data alone and was inferior to using either the joint likelihood or correlation approach. Our results suggest that correlation models provide a robust alternative to joint likelihood models when covariates related to effort or detection in PO data are not available. Ecologists should be aware of the limitations of each approach and consider how well biases in the data can be modelled when deciding which type of IDM to use.

Highlights

  • Integrated species distribution models (IDMs), which combine multiple data sources to model species distributions, are becoming increasingly common (Isaac et al, 2020)

  • This study set out to investigate the performance of three integrated species distribution models under a range of scenarios

  • The simulation study demonstrated that the joint model did not always perform better than the single PA model unlike the study by Pacifici et al (2017) whose integrated models always produced better predictions than its single model when the underlying assumption that the two data sources were related was met

Read more

Summary

BIODIVERSITY METHODS

Integrated species distribution models: A comparison of approaches under different data quality scenarios. Present address Siti Sarah Ahmad Suhaimi, PPG Coatings (Malaysia), UOA Business Park, Shah Alam, Malaysia. Funding information Engineering and Physical Sciences Research Council, Grant/Award Number: EP/ P002285/1; Natural Environment Research Council, Grant/Award Number: NE/ R016429/1

| INTRODUCTION
| METHODS
| DISCUSSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.