Leveraging Side Information to Improve Label Quality Control in Crowd-Sourcing

Yuan Jin,Mark Carman,Dongwoo Kim,Lexing Xie

doi:10.1609/hcomp.v5i1.13315

Abstract

We investigate the possibility of leveraging side information for improving quality control over crowd-sourced data. We extend the GLAD model, which governs the probability of correct labeling through a logistic function in which worker expertise counteracts item difficulty, by systematically encod- ing different types of side information, including worker in- formation drawn from demographics and personality traits, item information drawn from item genres and content, and contextual information drawn from worker responses and la- beling sessions. Modeling side information allows for better estimation of worker expertise and item difficulty in sparse data situations and accounts for worker biases, leading to bet- ter prediction of posterior true label probabilities. We demon- strate the efficacy of the proposed framework with overall improvements in both the true label prediction and the un- seen worker response prediction based on different combina- tions of the various types of side information across three new crowd-sourcing datasets. In addition, we show the framework exhibits potential of identifying salient side information fea- tures for predicting the correctness of responses without the need of knowing any true label information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Leveraging Side Information to Improve Label Quality Control in Crowd-Sourcing

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing

Lead the way for us

Journal: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing	Publication Date: Sep 21, 2017
Citations: 5

Similar Papers

Exploiting side information in distance dependent Chinese restaurant processes for data clustering
Cheng Li ... Dinh Phung
-
Cheng Li, et. al. Cheng Li ... Dinh Phung
01 Jul 2013
01 Jul 2013

Learning Dynamical Systems with Side Information
Amir Ali Ahmadi ... Bachir El Khadir
SIAM Review | VOL. 65
Amir Ali Ahmadi, et. al.Amir Ali Ahmadi ... Bachir El Khadir
01 Feb 2023
SIAM Review | VOL. 65

Optimal Power Management for Remote Estimation With an Energy Harvesting Sensor
Yu Zhao ... Biao Chen
IEEE Transactions on Wireless Communications | VOL. 14
Yu Zhao, et. al.Yu Zhao ... Biao Chen
01 Nov 2015
IEEE Transactions on Wireless Communications | VOL. 14

Kernelized Probabilistic Matrix Factorization: Exploiting Graphs and Side Information
Tinghui Zhou ... Guillermo Sapiro
-
Tinghui Zhou, et. al.Tinghui Zhou ... Guillermo Sapiro
26 Apr 2012
26 Apr 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Leveraging Side Information to Improve Label Quality Control in Crowd-Sourcing

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing