Human Annotation Effort Research Articles

Active learning can reduce the human effort required for labeling training samples while preserving the performance of visual classifiers. However, existing active learning frameworks cannot be used to perform visual classification of industrial product surface defects because they still require intensive manual annotation efforts. In this study, we propose a cost-efficient autolabeling-enhanced active learning (ALEAL) framework to reduce the human annotation effort required for surface defect visual classification. The proposed ALEAL framework employs a deep convolutional neural network (CNN) as a visual classifier trained from an initial set of human-labeled training samples. Then, the collected unlabeled training samples are input into the classifier for category confidence estimation. Next, a novel diverse cost-effective query strategy (DCEQS) is proposed to select some high-confidence samples for autolabeling and some informative samples for sample proposals that need labeling. Subsequently, to further reduce the human annotation effort, a novel autolabeling module is proposed and introduced in ALEAL that can automatically label a portion of the informative unlabeled training samples selected by the DCEQS. In this study, a novel attention-based similarity measurement network (ASMN) is proposed as an implementation of this autolabeling module by measuring the similarity between unlabeled and labeled samples. Finally, the remaining unlabeled samples are annotated by human experts, and all the newly labeled samples are used to retrain the classifier. Through the autolabeling process from the DCEQS and ASMN, ALEAL can automatically label additional training samples and achieve a competitive performance while requiring few human-labeled training samples, which is highly important in industrial applications. Extensive experimental results show that, compared with popular active learning methods, ALEAL can dramatically reduce the effort involved in human annotation and achieve state-of-the-art cost efficiency for the visual classification of industrial product surface defects.

Read full abstract

Knowledge graph is becoming an indispensable resource that offers structured information for numerous AI applications. However, the knowledge graph often suffers from its incompleteness. Building a complete, high-quality knowledge graph is time-consuming and requires significant human annotation efforts. In this paper, we study the Knowledge Graph Population task, which aims at extending the scale of structured knowledge, with a special focus on reducing data preparation and annotation efforts. Previous works mainly based on discriminative methods build classifiers and verify candidate triplets that are extracted from texts, which heavily rely on the quality of data collection and co-occurrance of entities in the text. However, such methods fail to generalize on entity pairs that are not highly co-occurred, and fail to discover entity pairs that are not co-occurred at all in the given text corpus. We introduce a generative perspective to approach this task and define each relationship by learning the data distribution that embodies the core common properties for relational reasoning. A generative model <sc xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">KGGen</small> is proposed, which samples from the learned data distribution for each relation and can generate triplets regardless of entity pair co-occurrence in the text corpus. To further improve the generation quality while alleviate human annotation efforts, adversarial learning is adopted to not only encourage generating high quality triplets, but also give model the ability to automatically assess the generation quality. Quantitative and qualitative experimental results conducted on two real-world generic knowledge graphs show that the proposed model <sc xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">KGGen</small> generates novel and meaningful triplets with improved efficiency and less human annotation comparing with the state-of-the-art approaches.

Read full abstract

Human Annotation Effort Research Articles

Articles published on Human Annotation Effort

Weakly Supervised Cell Segmentation by Point Annotation.

Autolabeling-Enhanced Active Learning for Cost-Efficient Surface Defect Visual Classification

Hierarchical Active Learning with Overlapping Regions.

KGGen: A Generative Approach for Incipient Knowledge Graph Population

Asking the Right Questions to the Right Users: Active Learning with Imperfect Oracles

Weakly-Supervised Semantic Segmentation With Regional Location Cutting and Dynamic Credible Regions Correction

An Active Deep Learning Approach for Minimally Supervised PolSAR Image Classification

Distribution-Based Semi-Supervised Learning for Activity Recognition

Region-Based Active Learning with Hierarchical and Adaptive Region Construction.

Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch

Hierarchical Active Learning with Proportion Feedback on Regions.

Integrating heterogeneous sources for predicting question temporal anchors across Yahoo! Answers

Hierarchical Active Learning with Group Proportion Feedback.

Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification

Multicriteria-Based Active Discriminative Dictionary Learning for Scene Recognition

Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning

Knowledge base population using semantic label propagation

Domain adaptation for microscopy imaging.

Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes.

Combining Active and Ensemble Learning for Efficient Classification of Web Documents

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Human Annotation Effort Research Articles

Articles published on Human Annotation Effort

Weakly Supervised Cell Segmentation by Point Annotation.

Autolabeling-Enhanced Active Learning for Cost-Efficient Surface Defect Visual Classification

Hierarchical Active Learning with Overlapping Regions.

KGGen: A Generative Approach for Incipient Knowledge Graph Population

Asking the Right Questions to the Right Users: Active Learning with Imperfect Oracles

Weakly-Supervised Semantic Segmentation With Regional Location Cutting and Dynamic Credible Regions Correction

An Active Deep Learning Approach for Minimally Supervised PolSAR Image Classification

Distribution-Based Semi-Supervised Learning for Activity Recognition

Region-Based Active Learning with Hierarchical and Adaptive Region Construction.

Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch

Hierarchical Active Learning with Proportion Feedback on Regions.

Integrating heterogeneous sources for predicting question temporal anchors across Yahoo! Answers

Hierarchical Active Learning with Group Proportion Feedback.

Adaptive Feature Selection and Feature Fusion for Semi-supervised Classification

Multicriteria-Based Active Discriminative Dictionary Learning for Scene Recognition

Multilingual Metaphor Processing: Experiments with Semi-Supervised and Unsupervised Learning

Knowledge base population using semantic label propagation

Domain adaptation for microscopy imaging.

Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes.

Combining Active and Ensemble Learning for Efficient Classification of Web Documents