Abstract

Citizen Cyberscience Projects (CCPs) that recruit members of the public as volunteers to process and produce large datasets promise a great deal of benefits to scientists and science. However, if this promise is to be realised, and citizen science-produced datasets are to be widely used by scientists, it is essential that these datasets win the trust of the scientific community. This task of securing credibility involves, in part, applying standard scientific procedures to clean up datasets formed by volunteer contributions. However, the management of volunteers’ behaviour in terms of how they contribute also plays a significant role in improving both the quality of individual contributions and the overall robustness of the resultant datasets. This can assist CCPs in securing a reputation for producing trustworthy datasets. Through a case study of Galaxy Zoo, a CCP set up to generate datasets based on volunteer classifications of galaxy morphologies, this paper explores how those involved in running the project manage volunteers. In particular, it focuses on how methods for crediting volunteer contributions motivate volunteers to provide higher quality contributions and to behave in a way that better corresponds to statistical assumptions made when combining volunteer contributions into datasets. These methods have made a significant contribution to the success of the project in securing trust in these datasets, which have been well used by other scientists. Implications for practice are then presented for CCPs, providing a list of considerations to guide choices regarding how to credit volunteer contributions to improve the quality and trustworthiness of citizen science-produced datasets.

Highlights

  • Recent decades have seen an explosion of scientific sub-disciplines and projects which involve the processing of large quantities of data (Hey, Tansley, & Tolle, 2009; Welsh, Jirotka, & Gavaghan, 2006)

  • It is vital for those who run a Citizen Cyberscience Project to consider how to secure credibility for the datasets being produced by their own Cyberscience Projects (CCPs)

  • This paper has considered how methods of crediting volunteers in the Galaxy Zoo project have played a significant role in volunteer management to improve reliability of Galaxy Zoo datasets

Read more

Summary

Introduction

Recent decades have seen an explosion of scientific sub-disciplines and projects which involve the processing of large quantities of data (Hey, Tansley, & Tolle, 2009; Welsh, Jirotka, & Gavaghan, 2006). Another critical challenge that has recently faced many scientists is a political culture which increasingly calls on scientists to show a willingness to engage the lay public with their work (Wynne & Felt, 2007). Another group of CCPs that all use the same piece of middleware called the Berkeley Open Infrastructure for Networked Computing (BOINC2) involves nearly a quarter of a million currently active volunteers

Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.