Label Noise Research Articles

The count of mitotic figures (MFs) observed in hematoxylin and eosin (H&E)-stained slides is an important prognostic marker, as it is a measure for tumor cell proliferation. However, the identification of MFs has a known low inter-rater agreement. In a computer-aided setting, deep learning algorithms can help to mitigate this, but they require large amounts of annotated data for training and validation. Furthermore, label noise introduced during the annotation process may impede the algorithms’ performance. Unlike H&E, where identification of MFs is based mainly on morphological features, the mitosis-specific antibody phospho-histone H3 (PHH3) specifically highlights MFs. Counting MFs on slides stained against PHH3 leads to higher agreement among raters and has therefore recently been used as a ground truth for the annotation of MFs in H&E. However, as PHH3 facilitates the recognition of cells indistinguishable from H&E staining alone, the use of this ground truth could potentially introduce an interpretation shift and even label noise into the H&E-related dataset, impacting model performance. This study analyzes the impact of PHH3-assisted MF annotation on inter-rater reliability and object level agreement through an extensive multi-rater experiment. Subsequently, MF detectors, including a novel dual-stain detector, were evaluated on the resulting datasets to investigate the influence of PHH3-assisted labeling on the models’ performance. We found that the annotators’ object-level agreement significantly increased when using PHH3-assisted labeling (F1: 0.53 to 0.74). However, this enhancement in label consistency did not translate to improved performance for H&E-based detectors, neither during the training phase nor the evaluation phase. Conversely, the dual-stain detector was able to benefit from the higher consistency. This reveals an information mismatch between the H&E and PHH3-stained images as the cause of this effect, which renders PHH3-assisted annotations not well-aligned for use with H&E-based detectors. Based on our findings, we propose an improved PHH3-assisted labeling procedure.

Read full abstract

Emotion classification using electroencephalographic (EEG) data is a challenging task in the field of Artificial Intelligence. While many researchers have focused on finding the best model or feature extraction technique to achieve optimal results, few have attempted to select the best methodological steps for working with the dataset. In this study, we applied two different theoretical approaches based on the noise of the dataset: curriculum learning and confident learning. Curriculum learning involves presenting training examples to the model in a specific order, starting with easier examples and gradually increasing in difficulty. This approach has been shown to improve model performance. Confident learning is a method for identifying and correcting label errors in datasets. By identifying and correcting these errors, confident learning can improve the performance of machine learning models trained on noisy datasets. We then applied the Integrated Gradient technique in order to assess the explainability of each model. Our aim was to explore the impact of different models and methods on emotion classification performance using EEG data. We collected and used an EEG dataset in which participants rated the emotional valence of positive and negative pictures while performing an emotion regulation (ER) task, comparing a control condition (Look) with two ER strategies: cognitive reappraisal and expressive suppression. We performed a multilabel classification to identify emotional neutrality or polarization of emotional valence (both positive and negative) rated by participants and the emotion regulation strategy adopted during the task. We compared the performance of models trained on three datasets selected based on label noise and evaluated their suitability for this task. Our results suggest different patterns based on the architecture used for feature importance, highlighting both advantages and criticisms.

Read full abstract

Label Noise Research Articles

Related Topics

Articles published on Label Noise

Reverse stable diffusion: What prompt was used to generate this image?

Enhanced Plant Leaf Classification over a Large Number of Classes Using Machine Learning

SPORT: A Subgraph Perspective on Graph Classification with Label Noise

Separating Noisy Samples From Tail Classes for Long-Tailed Image Classification With Label Noise.

Information mismatch in PHH3-assisted mitosis annotation leads to interpretation shifts in H&E slide analysis

Efficient ensemble uncertainty estimation in Gaussian processes regression

Sample feature enhancement model based on heterogeneous graph representation learning for few-shot relation classification

Domain adaptation for semantic segmentation of road scenes via two-stage alignment of traffic elements

Multi-instance learning in the presence of positive and unlabeled bags

EEG-based emotional valence and emotion regulation classification: a data-centric and explainable approach

Mises-Fisher similarity-based boosted additive angular margin loss for breast cancer classification

Sound quality characteristics of reciprocating hermetic refrigeration compressors

Beyond clean data: Exploring the effects of label noise on object detection performance

Pseudo-label refinement via hierarchical contrastive learning for source-free unsupervised domain adaptation

Radiomic Prediction of Parkinson’s Disease Deep Brain Stimulation Surgery Outcomes using Quantitative Susceptibility Mapping and Label Noise Compensation

A New Joint Training Method for Facial Expression Recognition with Inconsistently Annotated and Imbalanced Data

Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos.

A performance-based mental workload identification method for special vehicle crews

Critical insights into data curation and label noise for accurate prediction of aerobic biodegradability of organic chemicals.

Intuitionistic fuzzy least squares MLTSVM for noisy label data using label-specific features and local label correlation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Label Noise Research Articles

Related Topics

Articles published on Label Noise

Reverse stable diffusion: What prompt was used to generate this image?

Enhanced Plant Leaf Classification over a Large Number of Classes Using Machine Learning

SPORT: A Subgraph Perspective on Graph Classification with Label Noise

Separating Noisy Samples From Tail Classes for Long-Tailed Image Classification With Label Noise.

Information mismatch in PHH3-assisted mitosis annotation leads to interpretation shifts in H&E slide analysis

Efficient ensemble uncertainty estimation in Gaussian processes regression

Sample feature enhancement model based on heterogeneous graph representation learning for few-shot relation classification

Domain adaptation for semantic segmentation of road scenes via two-stage alignment of traffic elements

Multi-instance learning in the presence of positive and unlabeled bags

EEG-based emotional valence and emotion regulation classification: a data-centric and explainable approach

Mises-Fisher similarity-based boosted additive angular margin loss for breast cancer classification

Sound quality characteristics of reciprocating hermetic refrigeration compressors

Beyond clean data: Exploring the effects of label noise on object detection performance

Pseudo-label refinement via hierarchical contrastive learning for source-free unsupervised domain adaptation

Radiomic Prediction of Parkinson’s Disease Deep Brain Stimulation Surgery Outcomes using Quantitative Susceptibility Mapping and Label Noise Compensation

A New Joint Training Method for Facial Expression Recognition with Inconsistently Annotated and Imbalanced Data

Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos.

A performance-based mental workload identification method for special vehicle crews

Critical insights into data curation and label noise for accurate prediction of aerobic biodegradability of organic chemicals.

Intuitionistic fuzzy least squares MLTSVM for noisy label data using label-specific features and local label correlation