Selective Sampling for Nearest Neighbor Classifiers

Michael Lindenbaum,Shaul Markovitch,Dmitry Rusakov

doi:10.1023/b:mach.0000011805.60520.fe

Abstract

Most existing inductive learning algorithms work under the assumption that their training examples are already tagged. There are domains, however, where the tagging procedure requires significant computation resources or manual labor. In such cases, it may be beneficial for the learner to be active, intelligently selecting the examples for labeling with the goal of reducing the labeling cost. In this paper we present LSS—a lookahead algorithm for selective sampling of examples for nearest neighbor classifiers. The algorithm is looking for the example with the highest utility, taking its effect on the resulting classifier into account. Computing the expected utility of an example requires estimating the probability of its possible labels. We propose to use the random field model for this estimation. The LSS algorithm was evaluated empirically on seven real and artificial data sets, and its performance was compared to other selective sampling algorithms. The experiments show that the proposed algorithm outperforms other methods in terms of average error rate and stability.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Selective Sampling for Nearest Neighbor Classifiers

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Feb 1, 2004
Citations: 213

Similar Papers

Selective sampling for trees and forests
Murad Badarna ... Ilan Shimshoni
Neurocomputing | VOL. 358
Murad Badarna, et. al.Murad Badarna ... Ilan Shimshoni
10 May 2019
Neurocomputing | VOL. 358

Active learning: theory and applications to automatic speech recognition
G Riccardi ... D Hakkani-Tur
IEEE Transactions on Speech and Audio Processing | VOL. 13
G Riccardi, et. al.G Riccardi ... D Hakkani-Tur
01 Jul 2005
IEEE Transactions on Speech and Audio Processing | VOL. 13

Author response: Limitations of principal components in quantitative genetic association models for human studies
Yiqi Yao ... Alejandro Ochoa
-
Yiqi Yao, et. al.Yiqi Yao ... Alejandro Ochoa
25 Apr 2023
25 Apr 2023

Decision letter: Limitations of principal components in quantitative genetic association models for human studies
Magnus Nordborg ... Detlef Weigel
-
Magnus Nordborg, et. al.Magnus Nordborg ... Detlef Weigel
04 Jul 2022
04 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Selective Sampling for Nearest Neighbor Classifiers

Abstract

Talk to us

Similar Papers

More From: Machine Learning