Inventory statistics meet big data: complications for estimating numbers of species

Ali Khalighifar,Kate Ingenloff,Benedictus Freeman,Laura Jiménez,Town Peterson,Daniel Jiménez-García,Claudia Nuñez-Penichet

doi:10.7717/peerj.8872

Abstract

We point out complications inherent in biodiversity inventory metrics when applied to large-scale datasets. The number of units of inventory effort (e.g., days of inventory effort) in which a species is detected saturates, such that crucial numbers of detections of rare species approach zero. Any rare errors can then come to dominate species richness estimates, creating upward biases in estimates of species numbers. We document the problem via simulations of sampling from virtual biotas, illustrate its potential using a large empirical dataset (bird records from Cape May, NJ, USA), and outline the circumstances under which these problems may be expected to emerge.

Highlights

Biodiversity measurements have important implications for conservation efforts (Sousa-Baena, Garcia & Peterson, 2014)
We have shown that any errors in the data, even at very minor frequencies, can end up dominating the estimation process with the common and long-used nonparametric estimators, such as Chao2; the older species accumulation curve approach would clearly overestimate numbers, given that “error” species would appear as species documented in the inventory
These biodiversity inventory statistics are important, offering crucial additional information to the process of biotic inventories; updating and amending these approaches to approaches that are less vulnerable to bias, or at least being cognizant of the potential for problems in estimation for big(ger) datasets, is important

Summary

Introduction

Biodiversity measurements have important implications for conservation efforts (Sousa-Baena, Garcia & Peterson, 2014). Biodiversity metrics provide information about community composition, numbers of species, and similarity or dissimilarity of species composition among sites (Colwell & Coddington, 1994), and can allow researchers to separate well-inventoried sites from partially-inventoried sites for macroecological analyses (Lobo et al, 2018). Tracking species richness in biodiversity inventories was originally achieved via visual assessment of asymptotic behavior of species accumulation curves (Karr, 1980), and with the quantitative assist of non-linear regressions (Clench, 1979; Soberón & Llorente, 1993). For the past 20+ years, non-parametric estimators of numbers of species have been used to estimate species richness, a set of estimators based on sampling theory (Chao, 1987). Diverse data origins and variable data quality pose significant challenges for such analyses, when data are drawn from publicly

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PeerJ	Publication Date: May 13, 2020
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Inventory statistics meet big data: complications for estimating numbers of species

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PeerJ

Lead the way for us

Similar Papers

Fitting N-mixture models to count data with unmodeled heterogeneity: Bias, diagnostics, and alternative approaches
Adam Duarte ... James T Peterson
Ecological Modelling | VOL. 374
Adam Duarte, et. al.Adam Duarte ... James T Peterson
28 Feb 2018
Ecological Modelling | VOL. 374

Occupancy‐based diversity profiles: capturing biodiversity complexities while accounting for imperfect detection.
Jesse F Abrams ... Rahel Sollmann
Ecography | VOL. 44
Jesse F Abrams, et. al.Jesse F Abrams ... Rahel Sollmann
30 Mar 2021
Ecography | VOL. 44

When Measure Matters: Coresident Sample Selection Bias in Estimating Intergenerational Mobility in Developing Countries
M Shahe Emran ... William H Greene
SSRN Electronic Journal | VOL. -
M Shahe Emran, et. al.M Shahe Emran ... William H Greene
04 Aug 2015
SSRN Electronic Journal | VOL. -

Bias in the Wagner-Nelson estimate of the fraction of drug absorbed.
Yibin Wang ... Jerry Nedelman
Pharmaceutical research | VOL. 19
Yibin Wang, et. al.Yibin Wang ... Jerry Nedelman
01 Jan 2002
Pharmaceutical research | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inventory statistics meet big data: complications for estimating numbers of species

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PeerJ