Integrating multiple data sources in species distribution modeling: a framework for data fusion.

Krishna Pacifici,Glenn Stauffer,Brian J Reich,Alexa Mckerrow,Jaime A Collazo,Susheela Singh,Beth Gardner,David A W Miller

doi:10.1002/ecy.1710

Abstract

The last decade has seen a dramatic increase in the use of species distribution models (SDMs) to characterize patterns of species' occurrence and abundance. Efforts to parameterize SDMs often create a tension between the quality and quantity of data available to fit models. Estimation methods that integrate both standardized and non-standardized data types offer a potential solution to the tradeoff between data quality and quantity. Recently several authors have developed approaches for jointly modeling two sources of data (one of high quality and one of lesser quality). We extend their work by allowing for explicit spatial autocorrelation in occurrence and detection error using a Multivariate Conditional Autoregressive (MVCAR) model and develop three models that share information in a less direct manner resulting in more robust performance when the auxiliary data is of lesser quality. We describe these three new approaches ("Shared," "Correlation," "Covariates") for combining data sources and show their use in a case study of the Brown-headed Nuthatch in the Southeastern U.S. and through simulations. All three of the approaches which used the second data source improved out-of-sample predictions relative to a single data source ("Single"). When information in the second data source is of high quality, the Shared model performs the best, but the Correlation and Covariates model also perform well. When the information quality in the second data source is of lesser quality, the Correlation and Covariates model performed better suggesting they are robust alternatives when little is known about auxiliary data collected opportunistically or through citizen scientists. Methods that allow for both data types to be used will maximize the useful information available for estimating species distributions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Ecology	Publication Date: Mar 1, 2017
Citations: 201	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Integrating multiple data sources in species distribution modeling: a framework for data fusion.

Abstract

Talk to us

Similar Papers

More From: Ecology

Lead the way for us

Similar Papers

A Sparse Areal Mixed Model for Multivariate Outcomes, with an Application to Zero-Inflated Census Data
Donald Musgrove ... John Hughes
-
Donald Musgrove, et. al.Donald Musgrove ... John Hughes
01 Jan 2019
01 Jan 2019

Monitoring protected areas from space: A multi-temporalassessment using raptors as biodiversity surrogates.
Adrián Regos ... Alberto Gil-Carrera
PLOS ONE | VOL. 12
Adrián Regos, et. al.Adrián Regos ... Alberto Gil-Carrera
24 Jul 2017
PLOS ONE | VOL. 12

Spatio-temporal model for crop yield forecasting
Panudet Saengseedam ... Nantachai Kantanantha
Journal of Applied Statistics | VOL. 44
Panudet Saengseedam, et. al.Panudet Saengseedam ... Nantachai Kantanantha
21 Apr 2016
Journal of Applied Statistics | VOL. 44

The effectiveness of species distribution models in predicting local abundance depends on model grain size.
Mattia Brambilla ... Luca Ilahiane
Ecology | VOL. 105
Mattia Brambilla, et. al.Mattia Brambilla ... Luca Ilahiane
21 Dec 2023
Ecology | VOL. 105

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrating multiple data sources in species distribution modeling: a framework for data fusion.

Abstract

Talk to us

Similar Papers

More From: Ecology