A graph-based approach for positive and unlabeled learning

Julio César Carnevali,Rafael Geraldeli Rossi,Evangelos Milios,Alneu De Andrade Lopes

doi:10.1016/j.ins.2021.08.099

Abstract

Positive and Unlabeled Learning (PUL) uses unlabeled documents and a few positive documents for retrieving a set of “interest” documents from a text collection. Usually, PUL approaches are based on the vector space model. However, when dealing with semi-supervised learning for text classification or information retrieval, graph-based approaches have been proved to outperform vector space model-based approaches. So, in this article, a graph-based approach for PUL is proposed: Label Propagation for Positive and Unlabeled Learning (LP-PUL). The proposed framework consists of three steps: (i) building a similarity graph, (ii) identifying reliable negative documents, and (iii) performing label propagation to classify the remaining unlabeled documents as positive or negative. We carried out experiments to measure the impact of the different choices in each step of the proposed framework. We also demonstrated that the proposal surpasses the classification performance of other PUL (RC-SVM, PU-LP, and PE-PUC) or one-class learning (k-NN-based, k-Means-based, and Dense Autoencoder) algorithms in terms of F1. Considering the best results of any algorithm used in the experimental evaluation, PU-PUL can improve the classification performance from 2%, when using only 1 labeled document, to 28%, when 30 labeled documents are employed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A graph-based approach for positive and unlabeled learning

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Sep 1, 2021
Citations: 18

Similar Papers

A network-based positive and unlabeled learning approach for fake news detection.
Mariana Caravanti De Souza ... Solange Oliveira Rezende
Machine learning | VOL. 111
Mariana Caravanti De Souza, et. al.Mariana Caravanti De Souza ... Solange Oliveira Rezende
18 Nov 2021
Machine learning | VOL. 111

Positive and Unlabeled Learning with Label Disambiguation
Chuang Zhang ... Jian Yang
-
Chuang Zhang, et. al.Chuang Zhang ... Jian Yang
01 Aug 2019
01 Aug 2019

A recent survey on instance-dependent positive and unlabeled learning
Chen Gong ... Jian Yang
Fundamental Research | VOL. -
Chen Gong, et. al.Chen Gong ... Jian Yang
01 Oct 2022
Fundamental Research | VOL. -

Fairness-aware Model-agnostic Positive and Unlabeled Learning
Ziwei Wu ... Jingrui He
-
Ziwei Wu, et. al.Ziwei Wu ... Jingrui He
20 Jun 2022
20 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A graph-based approach for positive and unlabeled learning

Abstract

Talk to us

Similar Papers

More From: Information Sciences