Hitting the target: stopping active learning at the cost-based optimum

Zac Pullar-Strecker,Katharina Dost,Eibe Frank,Jörg Wicker

doi:10.1007/s10994-022-06253-1

Abstract

Active learning allows machine learning models to be trained using fewer labels while retaining similar performance to traditional supervised learning. An active learner selects the most informative data points, requests their labels, and retrains itself. While this approach is promising, it raises the question of how to determine when the model is ‘good enough’ without the additional labels required for traditional evaluation. Previously, different stopping criteria have been proposed aiming to identify the optimal stopping point. Yet, optimality can only be expressed as a domain-dependent trade-off between accuracy and the number of labels, and no criterion is superior in all applications. As a further complication, a comparison of criteria for a particular real-world application would require practitioners to collect additional labelled data they are aiming to avoid by using active learning in the first place. This work enables practitioners to employ active learning by providing actionable recommendations for which stopping criteria are best for a given real-world scenario. We contribute the first large-scale comparison of stopping criteria for pool-based active learning, using a cost measure to quantify the accuracy/label trade-off, public implementations of all stopping criteria we evaluate, and an open-source framework for evaluating stopping criteria. Our research enables practitioners to substantially reduce labelling costs by utilizing the stopping criterion which best suits their domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Oct 14, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Hitting the target: stopping active learning at the cost-based optimum

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Confidence-based stopping criteria for active learning for data annotation
Jingbo Zhu ... Matthew Ma
ACM Transactions on Speech and Language Processing | VOL. 6
Jingbo Zhu, et. al.Jingbo Zhu ... Matthew Ma
01 Apr 2010
ACM Transactions on Speech and Language Processing | VOL. 6

Stopping Criterion for Active Learning with Model Stability
Yexun Zhang ... Ya Zhang
ACM Transactions on Intelligent Systems and Technology | VOL. 9
Yexun Zhang, et. al.Yexun Zhang ... Ya Zhang
25 Oct 2017
ACM Transactions on Intelligent Systems and Technology | VOL. 9

Stability-Based Stopping Criterion for Active Learning
Wenquan Wang ... Ya Zhang
-
Wenquan Wang, et. al.Wenquan Wang ... Ya Zhang
01 Dec 2014
01 Dec 2014

Stopping criteria for active learning of named entity recognition
Florian Laws ... Hinrich Schätze
-
Florian Laws, et. al.Florian Laws ... Hinrich Schätze
01 Jan 2008
01 Jan 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hitting the target: stopping active learning at the cost-based optimum

Abstract

Talk to us

Similar Papers

More From: Machine Learning