Multi-Class Active Learning by Integrating Uncertainty and Diversity

Zengmao Wang,Chen Wu,Xi Fang,Xinyao Tang

doi:10.1109/access.2018.2817845

Abstract

Active learning is a promising way to reduce the labeling cost with a limited training samples initially, and then iteratively select the most valuable samples from a large number of unlabeled data for labeling in order to construct a powerful classifier. The goal of active learning is to make the labeled data set has no redundancy as much as possible. Uncertainty and diversity are two important criteria for active learning. Currently, a promising way by combining uncertainty and diversity for active learning is developed. However, many of these methods are designed based on the binary class or uncertainty followed by diversity strategy. They are hard to select the most valuable samples for multiple classes with binary setting with diversity and uncertainty simultaneously. In this paper, we integrate uncertainty and diversity into one formula by multi-class settings. Uncertainty is measured by the margin minimum while diversity is measured by the maximum mean discrepancy, which is popular to measure the distribution between two data sets. By minimizing the upper bound for the true risk of the integrating formula, we find the samples that not only uncertainty but also diversity with each other. We conduct our experiments on 12 benchmark UC Irvine data sets, and the experimental results demonstrate that the proposed method performs better than some other state-of-the-art methods.

Full Text