A new self-paced learning method for privilege-based positive and unlabeled learning

Bo Liu,Junrui Liu,Yanshan Xiao,Qihang Chen,Kai Wang,Ruiguang Huang,Liangjiao Li

doi:10.1016/j.ins.2022.07.143

Abstract

Positive and unlabeled learning (PU learning) is a kind of problem whose goal is learning a two-classes classifier with little proportion of positive samples and numerous unlabeled samples. A series of studies focus on how to extract most likely negative samples from the unlabeled samples, and then train a classifier with the labeled samples as supervised learning. Previous PU learning methods always ignore the additional information called privileged information, which is just provided during the training process while unavailable during testing. In this paper, we propose a novel self-paced algorithm for PU learning with privileged information (SPUPI). The proposed SPUPI extracts some reliable negative samples from unlabeled samples at first, and then generates weights for the unlabeled samples according to the similarity with each class. After that, it builds a more accurate classifier based on privileged information and similarity weights by self-paced learning. By taking the self-paced learning into training, we can build the model with a few labeled samples from easy to complex. We also solve the problem by transforming the primal problem of the proposed model into its dual problem and achieving the PU classifier. Various experiments on the practical datasets indicate that the SPUPI has a better performance compared with previous methods.

Full Text