Abstract
Although quality control for accuracy is increasingly common in citizen science projects, there is still a risk that spatial biases of opportunistic data could affect results, especially if sample size is low. Here we evaluate how well the sampling locations of North Carolina’s Candid Critters citizen science camera trapping project represented available land cover types in the state and whether the sample size (4,295 sites) was sufficient to estimate ecological parameters (i.e., species occupancy) with low bias and error. Although most sampling was opportunistic, we used a “Plan, Encourage, Supplement” approach to improve our spatial coverage. We assessed potential biases by comparing seven dimensions of habitat (i.e., land cover, elevation, road density, etc.) sampled by camera traps with those available in the state, using a minimum sample threshold approach, and found that the variation of habitat across the state was sufficiently sampled. At the ecoregion level we sampled 99.2% (±0.01) of the variation of potential habitat “adequately” and 96.4% (±0.03) “very adequately.” Supplemental sampling by staff helped meet sampling adequacy for 6.8% of ecoregion-habitat classes, especially in less populated parts of the state. Compared with results from the full data set, the relative bias and error with subsets of the data dropped below 10% relatively quickly with increasing sample size for estimates of occupancy, suggesting that results estimated with the full sample are robust, although the precision of particular ecological relationships were more variable. These analyses show that opportunistic sampling can be representative of large areas if sample size is high enough and that a priori sampling goals can help improve coverage by encouraging volunteers to sample in certain places or through supplemental data collection by staff.
Highlights
Data quality is central to citizen science projects producing trustworthy scientific outcomes
Because ecological relationships might be different across the state, we considered how many camera traps fit into these categories separately in the three major ecoregions of the state
We recorded the estimated occupancy probabilities from the subsampled data, and after the 20 random samples, we summarized how those estimates compared with the full data set using relative bias (RBIAS) and relative root mean square error (RRMSE): RRMSE
Summary
Data quality is central to citizen science projects producing trustworthy scientific outcomes. As the citizen science approach became more popular, some scientists voiced concerns about the potential for volunteers to produce data sets without large amounts of error (Dickinson, Zuckerberg, and Bonter 2010) that would be able to detect changes in the population status of wild species (Danielsen et al 2014). In response to those challenges, many best practices in citizen science feature mechanisms to assess the quality of data. Both of these studies highlight the limitations of post-hoc corrections, emphasizing the importance of reducing bias at the point of data collection
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have