Two-step learning for crowdsourcing data classification.

Hao Yu,Jiaye Li,Zhaojiang Wu,Hang Xu,Lei Zhu

doi:10.1007/s11042-022-12793-4

Abstract

Crowdsourcing learning (Bonald and Combes 2016; Dawid and Skene, J R Stat Soc: Series C (Appl Stat), 28(1):20–28 1979; Karger et al. 2011; Li et al, IEEE Trans Knowl Data Eng, 28(9):2296–2319 2016; Liu et al. 2012; Schlagwein and Bjorn-Andersen, J Assoc Inform Syst, 15(11):3 2014; Zhang et al. 2014) plays an increasingly important role in the era of big data (Liu et al., IEEE Trans Syst Man Cybern: Syst, 48(12): 451–2461, 2017; Zhang et al. 2014) due to its ability to easily solve large-scale data annotations (Musen et al., J Amer Med Informs Assoc, 22(6):1148–1152 2015). However, in the process of crowdsourcing learning, the uneven knowledge level of workers often leads to low accuracy of the label after marking, which brings difficulties to the subsequent processing (Edwards and Teddy 2013) and analysis of crowdsourcing data. In order to solve this problem, this paper proposes a two-step learning crowdsourced data classification algorithm, which optimizes the original label data by simultaneously considering the two issues of different worker abilities and the similarity between crowdsourced data (Kasikci et al. 2013) samples, so as to get more accurate label data. The two-step learning algorithm mainly includes two steps. Firstly, the worker’s ability to label different samples is obtained by constructing and training the worker’s ability model, and then the similarity between samples is calculated by the cosine measurement method (Muflikhah and Baharudin 2009), and finally the original label data is optimized by combining the above two results. The experimental results also show that the two-step learning classification algorithm proposed in this article has achieved better experimental results than the comparison algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Multimedia tools and applications	Publication Date: Jul 9, 2022
Citations: 3	License type: NO-CC CODE

R Discovery Prime

R Discovery Prime

Two-step learning for crowdsourcing data classification.

Abstract

Talk to us

Similar Papers

More From: Multimedia tools and applications

Lead the way for us

Similar Papers

An RBF network with a two-step learning algorithm for developing a reservoir inflow forecasting model
Gwo-Fong Lin ... Ming-Chang Wu
Journal of Hydrology | VOL. 405
Gwo-Fong Lin, et. al.Gwo-Fong Lin ... Ming-Chang Wu
31 May 2011
Journal of Hydrology | VOL. 405

A Two-Step Learning Approach for Solving Full and Almost Full Cold Start Problems in Dyadic Prediction
Tapio Pahikkala ... Bernard De Baets
-
Tapio Pahikkala, et. al.Tapio Pahikkala ... Bernard De Baets
01 Jan 2014
01 Jan 2014

Two-step machine learning method for the rapid analysis of microvascular flow in intravital video microscopy
Ossama Mahmoud ... Mahmoud El-Sakka
Scientific Reports | VOL. 11
Ossama Mahmoud, et. al.Ossama Mahmoud ... Mahmoud El-Sakka
11 May 2021
Scientific Reports | VOL. 11

Two‐step Multivariate Classification of the Mechanisms of Toxic Action of Phenols
Shijin Ren
QSAR & Combinatorial Science | VOL. 22
Shijin RenShijin Ren
01 Aug 2003
QSAR & Combinatorial Science | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Two-step learning for crowdsourcing data classification.

Abstract

Talk to us

Similar Papers

More From: Multimedia tools and applications