Evaluation of the Spatial Biases and Sample Size of a Statewide Citizen Science Project

Roland Kays,Monica Lasky,Brent Pease,Arielle W Parsons,Krishna Pacifici

doi:10.5334/cstp.344

Roland Kays, Monica Lasky + Show 3 more

Open Access

PDF Available

https://doi.org/10.5334/cstp.344

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Although quality control for accuracy is increasingly common in citizen science projects, there is still a risk that spatial biases of opportunistic data could affect results, especially if sample size is low. Here we evaluate how well the sampling locations of North Carolina’s Candid Critters citizen science camera trapping project represented available land cover types in the state and whether the sample size (4,295 sites) was sufficient to estimate ecological parameters (i.e., species occupancy) with low bias and error. Although most sampling was opportunistic, we used a “Plan, Encourage, Supplement” approach to improve our spatial coverage. We assessed potential biases by comparing seven dimensions of habitat (i.e., land cover, elevation, road density, etc.) sampled by camera traps with those available in the state, using a minimum sample threshold approach, and found that the variation of habitat across the state was sufficiently sampled. At the ecoregion level we sampled 99.2% (±0.01) of the variation of potential habitat “adequately” and 96.4% (±0.03) “very adequately.” Supplemental sampling by staff helped meet sampling adequacy for 6.8% of ecoregion-habitat classes, especially in less populated parts of the state. Compared with results from the full data set, the relative bias and error with subsets of the data dropped below 10% relatively quickly with increasing sample size for estimates of occupancy, suggesting that results estimated with the full sample are robust, although the precision of particular ecological relationships were more variable. These analyses show that opportunistic sampling can be representative of large areas if sample size is high enough and that a priori sampling goals can help improve coverage by encouraging volunteers to sample in certain places or through supplemental data collection by staff.

Highlights

Data quality is central to citizen science projects producing trustworthy scientific outcomes
Because ecological relationships might be different across the state, we considered how many camera traps fit into these categories separately in the three major ecoregions of the state
We recorded the estimated occupancy probabilities from the subsampled data, and after the 20 random samples, we summarized how those estimates compared with the full data set using relative bias (RBIAS) and relative root mean square error (RRMSE): RRMSE

Summary

Introduction

Data quality is central to citizen science projects producing trustworthy scientific outcomes. As the citizen science approach became more popular, some scientists voiced concerns about the potential for volunteers to produce data sets without large amounts of error (Dickinson, Zuckerberg, and Bonter 2010) that would be able to detect changes in the population status of wild species (Danielsen et al 2014). In response to those challenges, many best practices in citizen science feature mechanisms to assess the quality of data. Both of these studies highlight the limitations of post-hoc corrections, emphasizing the importance of reducing bias at the point of data collection

Methods

Results

Conclusion