Representation Learning from Limited Educational Data with Crowdsourced Labels

Wentao Wang,Guowei Xu,Zitao Liu,Guoliang Li,Yan Huang,Jiliang Tang,Wenbiao Ding

doi:10.1109/tkde.2020.3017122

Abstract

Representation learning has been proven to play an important role in the unprecedented success of machine learning models in numerous tasks, such as machine translation, face recognition and recommendation. The majority of existing representation learning approaches often require large amounts of consistent and noise-free labels. However, labels are very limited in many real-world scenarios. Directly applying standard representation learning approaches on small labeled data sets will easily run into over-fitting problems and lead to sub-optimal solutions. Even worse, the limited labels are usually annotated by multiple workers with diverse expertise, which yields noises and inconsistency in such crowdsourced labels. In this paper, we propose a novel framework which aims to learn effective representations from limited data with crowdsourced labels. We design a grouping based deep neural network to learn embeddings from limited training samples and present a Bayesian confidence estimator to capture the inconsistency among crowdsourced labels. Furthermore, we develop a hard example selection procedure to adaptively pick up training examples that are being misclassified by the current version of the model. Extensive experiments conducted on three real-world educational data sets demonstrate the superiority of our framework on learning representations from limited data with crowdsourced labels.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Representation Learning from Limited Educational Data with Crowdsourced Labels

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering

Lead the way for us

Journal: IEEE Transactions on Knowledge and Data Engineering	Publication Date: Jan 1, 2020
Citations: 8

Similar Papers

Learning Effective Embeddings From Crowdsourced Labels: An Educational Case Study
Guowei Xu ... Wenbiao Ding
-
Guowei Xu, et. al.Guowei Xu ... Wenbiao Ding
01 Apr 2019
01 Apr 2019

NeuCrowd: neural sampling network for representation learning with crowdsourced labels
Yang Hao ... Zitao Liu
Knowledge and Information Systems | VOL. 64
Yang Hao, et. al.Yang Hao ... Zitao Liu
20 Feb 2022
Knowledge and Information Systems | VOL. 64

Temporal-aware Language Representation Learning From Crowdsourced Labels
Yang Hao ... Xiao Zhai
-
Yang Hao, et. al.Yang Hao ... Xiao Zhai
01 Jan 2020
01 Jan 2020

Regularizing the loss layer of CNNs for facial expression recognition using crowdsourced labels
Philip Lu ... Saila Shama
-
Philip Lu, et. al.Philip Lu ... Saila Shama
01 Nov 2017
01 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Representation Learning from Limited Educational Data with Crowdsourced Labels

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Knowledge and Data Engineering