Nuclear discrepancy for single-shot batch active learning

Tom J. Viering,Marco Loog,Jesse H. Krijthe

doi:10.1007/s10994-019-05817-y

Abstract

Active learning algorithms propose what data should be labeled given a pool of unlabeled data. Instead of selecting randomly what data to annotate, active learning strategies aim to select data so as to get a good predictive model with as little labeled samples as possible. Single-shot batch active learners select all samples to be labeled in a single step, before any labels are observed. We study single-shot active learners that minimize generalization bounds to select a representative sample, such as the maximum mean discrepancy (MMD) active learner. We prove that a related bound, the discrepancy, provides a tighter worst-case bound. We study these bounds probabilistically, which inspires us to introduce a novel bound, the nuclear discrepancy (ND). The ND bound is tighter for the expected loss under optimistic probabilistic assumptions. Our experiments show that the MMD active learner performs better than the discrepancy in terms of the mean squared error, indicating that tighter worst case bounds do not imply better active learning performance. The proposed active learner improves significantly upon the MMD and discrepancy in the realizable setting and a similar trend is observed in the agnostic setting, showing the benefits of a probabilistic approach to active learning. Our study highlights that assumptions underlying generalization bounds can be equally important as bound-tightness, when it comes to active learning performance. Code for reproducing our experimental results can be found at https://github.com/tomviering/NuclearDiscrepancy.

Highlights

Supervised machine learning models require enough labeled data to obtain good generalization performance
The Nuclear Discrepancy (ND) bound that provides the tightest bound on the expected loss under probabilistic assumptions that follow from the principle of maximum entropy
For the maximum mean discrepancy (MMD) active learner, studied by Chattopadhyay et al (2012); Wang and Ye (2013), we give new theoretical results: an improved bound for active learning and we provide a principled way to choose the kernel for the MMD

Summary

Introduction

Supervised machine learning models require enough labeled data to obtain good generalization performance. For many practical applications such as medical diagnosis or video topic prediction labeling data can be expensive or time consuming (Settles 2012). Often in these settings unlabeled data is abundant. In active learning an algorithm chooses unlabeled samples for labeling (Cohn et al 1994). The idea is that models can perform better with less labeled data if the labeled data is chosen carefully instead of randomly. Active learning makes the most of a small labeling budget and can reduce labeling costs

Objectives

Methods

Findings

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Jun 26, 2019
Citations: 6	License type: open-access

R Discovery Prime

R Discovery Prime

Nuclear discrepancy for single-shot batch active learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Maximum Mean and Covariance Discrepancy for Unsupervised Domain Adaptation
Wenju Zhang ... Long Lan
Neural Processing Letters | VOL. 51
Wenju Zhang, et. al.Wenju Zhang ... Long Lan
08 Aug 2019
Neural Processing Letters | VOL. 51

The neural coding of expected and unexpected monetary performance outcomes: Dissociations between active and observational learning
C Bellebaum ... I Daum
Behavioural Brain Research | VOL. 227
C Bellebaum, et. al.C Bellebaum ... I Daum
06 Nov 2011
Behavioural Brain Research | VOL. 227

Adaptive Assessment of Power System Transient Stability Based on Active Transfer Learning With Deep Belief Network
Baoqin Li ... Junyong Wu
IEEE Transactions on Automation Science and Engineering | VOL. 20
Baoqin Li, et. al.Baoqin Li ... Junyong Wu
01 Apr 2023
IEEE Transactions on Automation Science and Engineering | VOL. 20

Using Active Learning Methods for Predicting Fraudulent Financial Statements
Stamatis Karlos ... Georgios Kostopoulos
-
Stamatis Karlos, et. al.Stamatis Karlos ... Georgios Kostopoulos
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nuclear discrepancy for single-shot batch active learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning