Deep Active Learning via Open-Set Recognition.

Jaya Krishna Mandivarapu,Blake Camp,Rolando Estrada

doi:10.3389/frai.2022.737363

Jaya Krishna Mandivarapu, Blake Camp + Show 1 more

Open Access

PDF Available

https://doi.org/10.3389/frai.2022.737363

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

In many applications, data is easy to acquire but expensive and time-consuming to label, prominent examples include medical imaging and NLP. This disparity has only grown in recent years as our ability to collect data improves. Under these constraints, it makes sense to select only the most informative instances from the unlabeled pool and request an oracle (e.g., a human expert) to provide labels for those samples. The goal of active learning is to infer the informativeness of unlabeled samples so as to minimize the number of requests to the oracle. Here, we formulate active learning as an open-set recognition problem. In this paradigm, only some of the inputs belong to known classes; the classifier must identify the rest as unknown. More specifically, we leverage variational neural networks (VNNs), which produce high-confidence (i.e., low-entropy) predictions only for inputs that closely resemble the training data. We use the inverse of this confidence measure to select the samples that the oracle should label. Intuitively, unlabeled samples that the VNN is uncertain about contain features that the network has not been exposed to; thus they are more informative for future training. We carried out an extensive evaluation of our novel, probabilistic formulation of active learning, achieving state-of-the-art results on MNIST, CIFAR-10, CIFAR-100, and FashionMNIST. Additionally, unlike current active learning methods, our algorithm can learn even in the presence of out-of-distribution outliers. As our experiments show, when the unlabeled pool consists of a mixture of samples from multiple datasets, our approach can automatically distinguish between samples from seen vs. unseen datasets. Overall, our results show that high-quality uncertainty measures are key for pool-based active learning.

Highlights

Supervised deep learning has achieved remarkable results across a variety of domains by leveraging large, labeled datasets (LeCun et al, 2015)
As we show in our experiments, our open-set recognition (OSR)-based active learning (AL) method can automatically ignore samples that do not belong to the target dataset
Uncertainty sampling is suitable for active learning problems in which all unlabeled samples belong to known classes

Summary

Introduction

Supervised deep learning has achieved remarkable results across a variety of domains by leveraging large, labeled datasets (LeCun et al, 2015). Our ability to collect data far outstrips our ability to label it, and this difference only continues to grow. This problem is especially stark in domains where acquiring the ground truth requires a highly trained specialist, e.g., medical imaging. There often exists a small subset of highly informative samples that can provide most of the information needed to learn to solve a task. In this case, we can achieve nearly the same

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Artificial Intelligence	Publication Date: Feb 7, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

Deep Active Learning via Open-Set Recognition.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence

Lead the way for us

Similar Papers

The Importance of Metric Learning for Robotic Vision: Open Set Recognition and Active Learning
Benjamin J Meyer ... Tom Drummond
-
Benjamin J Meyer, et. al.Benjamin J Meyer ... Tom Drummond
01 May 2019
01 May 2019

Active Learning with Spatial Distribution based Semi-Supervised Extreme Learning Machine for Multiclass Classification
Yuefan Xu ... Wendong Xiao
-
Yuefan Xu, et. al.Yuefan Xu ... Wendong Xiao
01 May 2019
01 May 2019

A Maximum Entropy Framework for Semisupervised and Active Learning With Unknown and Label-Scarce Classes.
Zhicong Qiu ... George Kesidis
IEEE Transactions on Neural Networks and Learning Systems | VOL. 28
Zhicong Qiu, et. al.Zhicong Qiu ... George Kesidis
26 Jan 2016
IEEE Transactions on Neural Networks and Learning Systems | VOL. 28

Employing unlabeled data to improve the classification performance of SVM, and its application in audio event classification
Yan Leng ... Dengwang Li
Knowledge-Based Systems | VOL. 98
Yan Leng, et. al.Yan Leng ... Dengwang Li
17 Feb 2016
Knowledge-Based Systems | VOL. 98

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Deep Active Learning via Open-Set Recognition.

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Frontiers in Artificial Intelligence