Platform-Related Factors in Repeatability and Reproducibility of Crowdsourcing Tasks

Rehab Qarout,Alessandro Checco,Kalina Bontcheva,Gianluca Demartini

doi:10.1609/hcomp.v7i1.5264

Abstract

Crowdsourcing platforms provide a convenient and scalable way to collect human-generated labels on-demand. This data can be used to train Artificial Intelligence (AI) systems or to evaluate the effectiveness of algorithms. The datasets generated by means of crowdsourcing are, however, dependent on many factors that affect their quality. These include, among others, the population sample bias introduced by aspects like task reward, requester reputation, and other filters introduced by the task design.In this paper, we analyse platform-related factors and study how they affect dataset characteristics by running a longitudinal study where we compare the reliability of results collected with repeated experiments over time and across crowdsourcing platforms. Results show that, under certain conditions: 1) experiments replicated across different platforms result in significantly different data quality levels while 2) the quality of data from repeated experiments over time is stable within the same platform. We identify some key task design variables that cause such variations and propose an experimentally validated set of actions to counteract these effects thus achieving reliable and repeatable crowdsourced data collection experiments.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Platform-Related Factors in Repeatability and Reproducibility of Crowdsourcing Tasks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing

Lead the way for us

Journal: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing	Publication Date: Oct 28, 2019
Citations: 7

Similar Papers

Predictive modeling in reproductive medicine: Where will the future of artificial intelligence research take us?
Carol Lynn Curchoe ... Zev Rosenwaks
Fertility and sterility | VOL. 114
Carol Lynn Curchoe, et. al.Carol Lynn Curchoe ... Zev Rosenwaks
01 Nov 2020
Fertility and sterility | VOL. 114

Artificial intelligence to complement rather than replace radiologists in breast screening
Sian Taylor-Phillips ... Karoline Freeman
The Lancet Digital Health | VOL. 4
Sian Taylor-Phillips, et. al.Sian Taylor-Phillips ... Karoline Freeman
21 Jun 2022
The Lancet Digital Health | VOL. 4

Human-Computer Interaction Techniques for Explainable Artificial Intelligence Systems
S Tharun Anand Reddy
Research & Review: Machine Learning and Cloud Computing | VOL. 3
S Tharun Anand ReddyS Tharun Anand Reddy
26 Mar 2024
Research & Review: Machine Learning and Cloud Computing | VOL. 3

Is the future of medical diagnosis in computer algorithms?
Karl Gruber
The Lancet Digital Health | VOL. 1
Karl GruberKarl Gruber
01 May 2019
The Lancet Digital Health | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Platform-Related Factors in Repeatability and Reproducibility of Crowdsourcing Tasks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing