Sampbias, a method for quantifying geographic sampling biases in species distribution data

Alexander Zizka,Daniele Silvestro,Alexandre Antonelli

doi:10.1111/ecog.05102

Abstract

Geo‐referenced species occurrences from public databases have become essential to biodiversity research and conservation. However, geographical biases are widely recognized as a factor limiting the usefulness of such data for understanding species diversity and distribution. In particular, differences in sampling intensity across a landscape due to differences in human accessibility are ubiquitous but may differ in strength among taxonomic groups and data sets. Although several factors have been described to influence human access (such as presence of roads, rivers, airports and cities), quantifying their specific and combined effects on recorded occurrence data remains challenging. Here we present sampbias, an algorithm and software for quantifying the effect of accessibility biases in species occurrence data sets. sampbias uses a Bayesian approach to estimate how sampling rates vary as a function of proximity to one or multiple bias factors. The results are comparable among bias factors and data sets. We demonstrate the use of sampbias on a data set of mammal occurrences from the island of Borneo, showing a high biasing effect of cities and a moderate effect of roads and airports. sampbias is implemented as a well‐documented, open‐access and user‐friendly R package that we hope will become a standard tool for anyone working with species occurrences in ecology, evolution, conservation and related fields.

Highlights

Available data sets of geo-referenced species occurrences, such as provided by the Global Biodiversity Information Facility () have become a fundamental resource in biological sciences, especially in biogeography, conservation and macroecology
Sampling biases that may affect the recording of species occurrences include the under-sampling of specific taxa (‘taxonomic bias’, e.g. birds versus nematodes), specific geographic regions (‘geographic bias’, e.g. accessible versus remote areas) and specific temporal periods (‘temporal bias’, e.g. wet versus dry season)
We present sampbias ver. 1.0.4, a probabilistic method to quantify accessibility bias in data sets of species occurrences. sampbias is implemented as a user-friendly R-package and uses a Bayesian approach to address three questions: 1) How strong is the accessibility bias in a given data set? 2) How strong is the effect of different bias factors in causing the overall accessibility bias? 3) How is accessibility bias distributed in space?

Summary

Background

Available data sets of geo-referenced species occurrences, such as provided by the Global Biodiversity Information Facility () have become a fundamental resource in biological sciences, especially in biogeography, conservation and macroecology These data sets are typically not collected systematically and rarely include information on collection effort. Physical accessibility by people is omnipresent as a bias factor (Kadmon et al 2004, Engemann et al 2015, Lin et al 2015), across spatial scales, as the commonly used term ‘roadside bias’ testifies This means that most species observations are made in or near cities, along roads, paths, rivers and near human settlements. It is crucial that researchers realise the intrinsic biases associated with the data they deal with, especially in cross-taxonomic studies, since occurrence data sets from different taxa are likely differently affected by sampling biases due to differences in specimen collection and transportation. The results may be used to identify priorities for further collection or digitalization efforts and to assess the reliability of scientific results based on publicly available species distribution data

General concept

Quantifying accessibility bias using a Bayesian framework

Bö å b

Example and empirical validation

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Ecography	Publication Date: Oct 8, 2020
Citations: 80	License type: CC BY 3.0

R Discovery Prime

Sampbias, a method for quantifying geographic sampling biases in species distribution data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Ecography

Lead the way for us

Similar Papers

Predicting invasive alien plant distributions: how geographical bias in occurrence records influences model performance
René Wolmarans ... Mark P Robertson
Journal of Biogeography | VOL. 37
René Wolmarans, et. al.René Wolmarans ... Mark P Robertson
16 Aug 2010
Journal of Biogeography | VOL. 37

Geographic selection bias of occurrence data influences transferability of invasive Hydrilla verticillata distribution models.
Matthew A Barnes ... W Lindsay Chadderton
Ecology and Evolution | VOL. 4
Matthew A Barnes, et. al.Matthew A Barnes ... W Lindsay Chadderton
26 May 2014
Ecology and Evolution | VOL. 4

Geographic sampling bias in the South African Frog Atlas Project: implications for conservation planning
Emily A Botts ... Graham J Alexander
Biodiversity and Conservation | VOL. 20
Emily A Botts, et. al.Emily A Botts ... Graham J Alexander
05 Dec 2010
Biodiversity and Conservation | VOL. 20

An Interactive, Online Web Map Resource of Global Fusarium oxysporum ff. spp. Diversity and Distribution.
Rocío Calderón ... Kaitlin M Gold
Plant disease | VOL. 107
Rocío Calderón, et. al.Rocío Calderón ... Kaitlin M Gold
31 Dec 2022
Plant disease | VOL. 107

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Sampbias, a method for quantifying geographic sampling biases in species distribution data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Ecography