A unified statistical framework for crowd labeling

Jafar Muhammadi,Abbas Hosseini,Hamid R Rabiee

doi:10.1007/s10115-014-0790-7

Abstract

Recently, there has been a burst in the number of research projects on human computation via crowdsourcing. Multiple-choice (or labeling) questions could be referred to as a common type of problem which is solved by this approach. As an application, crowd labeling is applied to find true labels for large machine learning datasets. Since crowds are not necessarily experts, the labels they provide are rather noisy and erroneous. This challenge is usually resolved by collecting multiple labels for each sample and then aggregating them to estimate the true label. Although the mechanism leads to high-quality labels, it is not actually cost-effective. As a result, efforts are currently made to maximize the accuracy in estimating true labels, while fixing the number of acquired labels. This paper surveys methods to aggregate redundant crowd labels in order to estimate unknown true labels. It presents a unified statistical latent model where the differences among popular methods in the field correspond to different choices for the parameters of the model. Afterward, algorithms to make inference on these models will be surveyed. Moreover, adaptive methods which iteratively collect labels based on the previously collected labels and estimated models will be discussed. In addition, this paper compares the distinguished methods and provides guidelines for future work required to address the current open issues.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A unified statistical framework for crowd labeling

Abstract

Talk to us

Similar Papers

More From: Knowledge and Information Systems

Lead the way for us

Journal: Knowledge and Information Systems	Publication Date: Oct 5, 2014
Citations: 23

Similar Papers

Joint Annotator-and-Spectrum Allocation in Wireless Networks for Crowd Labeling
Xiaoyang Li ... Yi Gong
IEEE Transactions on Wireless Communications | VOL. 19
Xiaoyang Li, et. al.Xiaoyang Li ... Yi Gong
02 Jun 2020
IEEE Transactions on Wireless Communications | VOL. 19

Joint Generative-Discriminative Aggregation Model for Multi-Option Crowd Labels
Kamran Ghasedi Dizaji ... Heng Huang
-
Kamran Ghasedi Dizaji, et. al.Kamran Ghasedi Dizaji ... Heng Huang
02 Feb 2018
02 Feb 2018

Predicting Worker Disagreement for More Effective Crowd Labeling
Stefan Rabiger ... Myra Spiliopoulou
-
Stefan Rabiger, et. al.Stefan Rabiger ... Myra Spiliopoulou
01 Oct 2018
01 Oct 2018

Comparison of variational transition state theory and the unified statistical model with vibrationally adiabatic transmission coefficients to accurate collinear rate constants for T+HD→TH+D
Bruce C Garrett ... Roger S Grev
The Journal of Chemical Physics | VOL. 73
Bruce C Garrett, et. al.Bruce C Garrett ... Roger S Grev
01 Jul 1980
The Journal of Chemical Physics | VOL. 73

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A unified statistical framework for crowd labeling

Abstract

Talk to us

Similar Papers

More From: Knowledge and Information Systems