A bagging SVM to learn from positive and unlabeled examples

F Mordelet,J.-P Vert

doi:10.1016/j.patrec.2013.06.010

Abstract

We consider the problem of learning a binary classifier from a training set of positive and unlabeled examples, both in the inductive and in the transductive setting. This problem, often referred to as PU learning, differs from the standard supervised classification problem by the lack of negative examples in the training set. It corresponds to an ubiquitous situation in many applications such as information retrieval or gene ranking, when we have identified a set of data of interest sharing a particular property, and we wish to automatically retrieve additional data sharing the same property among a large and easily available pool of unlabeled data. We propose a new method for PU learning with a conceptually simple implementation based on bootstrap aggregating (bagging) techniques: the algorithm iteratively trains many binary classifiers to discriminate the known positive examples from random subsamples of the unlabeled set, and averages their predictions. We show theoretically and experimentally that the method can match and even outperform the performance of state-of-the-art methods for PU learning, particularly when the number of positive examples is limited and the fraction of negatives among the unlabeled examples is small. The proposed method can also run considerably faster than state-of-the-art methods, particularly when the set of unlabeled examples is large.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A bagging SVM to learn from positive and unlabeled examples

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Jun 25, 2013
Citations: 258

Similar Papers

Cool Blog Classification from Positive and Unlabeled Examples
Kritsada Sriphaew ... Hiroya Takamura
-
Kritsada Sriphaew, et. al.Kritsada Sriphaew ... Hiroya Takamura
01 Jan 2009
01 Jan 2009

Building a Biased Least Squares Support Vector Machine Classifier for Positive and Unlabeled Learning
Ting Ke ... Xinbin Zhao
Journal of Software | VOL. -
Ting Ke, et. al.Ting Ke ... Xinbin Zhao
06 Jan 2014
Journal of Software | VOL. -

Learning classifiers from only positive and unlabeled data
Charles Elkan ... Keith Noto
-
Charles Elkan, et. al.Charles Elkan ... Keith Noto
24 Aug 2008
24 Aug 2008

Text classification without negative examples revisit
Gabriel Pui Cheong Fung ... J.X Yu
IEEE Transactions on Knowledge and Data Engineering | VOL. 18
Gabriel Pui Cheong Fung, et. al. Gabriel Pui Cheong Fung ... J.X Yu
01 Jan 2006
IEEE Transactions on Knowledge and Data Engineering | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A bagging SVM to learn from positive and unlabeled examples

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters