Crowdsourcing Relevance Assessments: The Unexpected Benefits of Limiting the Time to Judge

Eddy Maddalena,Dario De Nart,Marco Basaldella,Dante Degl'Innocenti,Stefano Mizzaro,Gianluca Demartini

doi:10.1609/hcomp.v4i1.13284

Abstract

Crowdsourcing has become an alternative approach to collect relevance judgments at scale thanks to the availability of crowdsourcing platforms and quality control techniques that allow to obtain reliable results. Previous work has used crowdsourcing to ask multiple crowd workers to judge the relevance of a document with respect to a query and studied how to best aggregate multiple judgments of the same topic-document pair. This paper addresses an aspect that has been rather overlooked so far: we study how the time available to express a relevance judgment affects its quality. We also discuss the quality loss of making crowdsourced relevance judgments more efficient in terms of time taken to judge the relevance of a document. We use standard test collections to run a battery of experiments on the crowdsourcing platform CrowdFlower, studying how much time crowd workers need to judge the relevance of a document and at what is the effect of reducing the available time to judge on the overall quality of the judgments. Our extensive experiments compare judgments obtained under different types of time constraints with judgments obtained when no time constraints were put on the task. We measure judgment quality by different metrics of agreement with editorial judgments. Experimental results show that it is possible to reduce the cost of crowdsourced evaluation collection creation by reducing the time available to perform the judgments with no loss in quality. Most importantly, we observed that the introduction of limits on the time available to perform the judgments improves the overall judgment quality. Top judgment quality is obtained with 25-30 seconds to judge a topic-document pair.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Crowdsourcing Relevance Assessments: The Unexpected Benefits of Limiting the Time to Judge

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing

Lead the way for us

Journal: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing	Publication Date: Sep 21, 2016
Citations: 15

Similar Papers

Information retrieval evaluation with humans in the loop
Gabriella Kazai
-
Gabriella KazaiGabriella Kazai
26 Aug 2014
26 Aug 2014

Feasibility of semiring-based timing constraints
Yue Yu ... Ophir Frieder
ACM Transactions in Embedded Computing Systems | VOL. 9
Yue Yu, et. al.Yue Yu ... Ophir Frieder
01 Mar 2010
ACM Transactions in Embedded Computing Systems | VOL. 9

Phenotypic plasticity and optimal timing of metamorphosis under uncertain time constraints
Volker H W Rudolf ... Mark-Oliver Rödel
Evolutionary Ecology | VOL. 21
Volker H W Rudolf, et. al.Volker H W Rudolf ... Mark-Oliver Rödel
10 Aug 2006
Evolutionary Ecology | VOL. 21

Coordinating Asynchronous and Open Distributed Systems under Semiring-Based Timing Constraints
Yue Yu ... Shangping Ren
Electronic Notes in Theoretical Computer Science | VOL. 229
Yue Yu, et. al.Yue Yu ... Shangping Ren
01 Jul 2009
Electronic Notes in Theoretical Computer Science | VOL. 229

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Crowdsourcing Relevance Assessments: The Unexpected Benefits of Limiting the Time to Judge

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing