A Maximum Entropy Framework for Semisupervised and Active Learning With Unknown and Label-Scarce Classes.

Zhicong Qiu,George Kesidis,David J Miller

doi:10.1109/tnnls.2016.2514401

Abstract

We investigate semisupervised learning (SL) and pool-based active learning (AL) of a classifier for domains with label-scarce (LS) and unknown categories, i.e., defined categories for which there are initially no labeled examples. This scenario manifests, e.g., when a category is rare, or expensive to label. There are several learning issues when there are unknown categories: 1) it is a priori unknown which subset of (possibly many) measured features are needed to discriminate unknown from common classes and 2) label scarcity suggests that overtraining is a concern. Our classifier exploits the inductive bias that an unknown class consists of the subset of the unlabeled pool's samples that are atypical (relative to the common classes) with respect to certain key (albeit a priori unknown) features and feature interactions. Accordingly, we treat negative log- p -values on raw features as nonnegatively weighted derived feature inputs to our class posterior, with zero weights identifying irrelevant features. Through a hierarchical class posterior, our model accommodates multiple common classes, multiple LS classes, and unknown classes. For learning, we propose a novel semisupervised objective customized for the LS/unknown category scenarios. While several works minimize class decision uncertainty on unlabeled samples, we instead preserve this uncertainty [maximum entropy (maxEnt)] to avoid overtraining. Our experiments on a variety of UCI Machine learning (ML) domains show: 1) the use of p -value features coupled with weight constraints leads to sparse solutions and gives significant improvement over the use of raw features and 2) for LS SL and AL, unlabeled samples are helpful, and should be used to preserve decision uncertainty (maxEnt), rather than to minimize it, especially during the early stages of AL. Our AL system, leveraging a novel sample-selection scheme, discovers unknown classes and discriminates LS classes from common ones, with sparing use of oracle labeling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Maximum Entropy Framework for Semisupervised and Active Learning With Unknown and Label-Scarce Classes.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Jan 26, 2016
Citations: 29

Similar Papers

Dirichlet Process Based Active Learning and Discovery of Unknown Classes for Hyperspectral Image Classification
Hao Wu ... Saurabh Prasad
IEEE Transactions on Geoscience and Remote Sensing | VOL. 54
Hao Wu, et. al.Hao Wu ... Saurabh Prasad
01 Aug 2016
IEEE Transactions on Geoscience and Remote Sensing | VOL. 54

A new semi-supervised approach for hyperspectral image classification with different active learning strategies
Inmaculada Dopido ... Antonio Plaza
-
Inmaculada Dopido, et. al.Inmaculada Dopido ... Antonio Plaza
01 Jun 2012
01 Jun 2012

Employing unlabeled data to improve the classification performance of SVM, and its application in audio event classification
Yan Leng ... Dengwang Li
Knowledge-Based Systems | VOL. 98
Yan Leng, et. al.Yan Leng ... Dengwang Li
17 Feb 2016
Knowledge-Based Systems | VOL. 98

A Generic Semi-Supervised and Active Learning Framework for Biomedical Text Classification.
Christopher A Flores ... Rodrigo Verschae
Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference | VOL. 2022
Christopher A Flores, et. al.Christopher A Flores ... Rodrigo Verschae
11 Jul 2022
11 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Maximum Entropy Framework for Semisupervised and Active Learning With Unknown and Label-Scarce Classes.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems