Iterative Human-in-the-Loop Discovery of Unknown Unknowns in Image Datasets

Lei Han,Xiao Dong,Gianluca Demartini

doi:10.1609/hcomp.v9i1.18941

Abstract

Automatic predictions (e.g., recognizing objects in images) may result in systematic errors if certain classes are not well represented by training instances (these errors are called unknowns). When a model assigns high confidence scores to these wrong predictions (this type of error is called unknown unknowns), it becomes challenging to automatically identify them. In this paper, we present the first work on leveraging human intelligence to discover unknown unknowns (UUs) in an iterative way. The proposed methodology first differentiates the feature space generated by crowd workers labelling instances (e.g., images) in an active learning fashion from the space learned by the prediction model over a batch training phase, and thus identifies the predictions most likely to be UUs. Next, we add crowd labels collected for these discovered UUs to the training set and re-train the model with this extended dataset. This process is then repeated iteratively to discover more instances of both unknown and under-represented classes. Our experimental results show that the proposed methodology is able to (1) efficiently discover UUs, (2) significantly improve the quality of model predictions, and (3) to push UUs into known unknowns (i.e., the model makes mistakes but at least its classification confidence on those instances is low so those predictions can be discarded or post-processed) for further investigation. We additionally discuss the trade-off between prediction quality improvements and the human effort required to achieve those improvements. Our results bear implications on building cost-effective systems to discover UUs with humans in the loop.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Iterative Human-in-the-Loop Discovery of Unknown Unknowns in Image Datasets

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing

Lead the way for us

Journal: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing	Publication Date: Oct 4, 2021
Citations: 2

Similar Papers

Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering
Jihyung Kil ... Dong Xuan
-
Jihyung Kil, et. al.Jihyung Kil ... Dong Xuan
01 Jan 2020
01 Jan 2020

Imbalance Learning and Its Application on Medical Datasets
Yachao Shao
-
Yachao ShaoYachao Shao
21 Feb 2022
21 Feb 2022

Learning and predicting the unknown class using evidential deep learning
Akihito Nagahama
Scientific Reports | VOL. 13
Akihito NagahamaAkihito Nagahama
09 Sep 2023
Scientific Reports | VOL. 13

Random and systematic navigation errors: How do they affect seismic data quality?
Josef Paffenholz ... Dennis Fryar
-
Josef Paffenholz, et. al.Josef Paffenholz ... Dennis Fryar
01 Jan 1992
01 Jan 1992

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Iterative Human-in-the-Loop Discovery of Unknown Unknowns in Image Datasets

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Human Computation and Crowdsourcing