Aggregating and Learning from Multiple Annotators

Silviu Paun,Edwin Simpson

doi:10.18653/v1/2021.eacl-tutorials.2

Abstract

The success of NLP research is founded on high-quality annotated datasets, which are usually obtained from multiple expert annotators or crowd workers. The standard practice to training machine learning models is to first adjudicate the disagreements and then perform the training. To this end, there has been a lot of work on aggregating annotations, particularly for classification tasks. However, many other tasks, particularly in NLP, have unique characteristics not considered by standard models of annotation, e.g., label interdependencies in sequence labelling tasks, unrestricted labels for anaphoric annotation, or preference labels for ranking texts. In recent years, researchers have picked up on this and are covering the gap. A first objective of this tutorial is to connect NLP researchers with state-of-the-art aggregation models for a diverse set of canonical language annotation tasks. There is also a growing body of recent work arguing that following the convention and training with adjudicated labels ignores any uncertainty the labellers had in their classifications, which results in models with poorer generalisation capabilities. Therefore, a second objective of this tutorial is to teach NLP workers how they can augment their (deep) neural models to learn from data with multiple interpretations.

Highlights

Introduction to the fieldShortcomings of early practices.2
Including the coders’ disagreements in the learning signal offers the models a richer source of information compared to adjudicated labels: they include the consensus, but may indicate ambiguity, and how the humans make mistakes. This improves the generalisation capability of the models and offers them a more graceful degradation with less ridiculous mistakes (Peterson et al, 2019; Guan et al, 2018). Some of these approaches can be used for their noise distillation capabilities, as their learning processes produce aggregated labels that leverage coder annotation patterns and the knowledge of the task accumulated by the model (Cao et al, 2018; Rodrigues and Pereira, 2018; Albarqouni et al, 2016; Chu et al, 2020)
We show how to reformulate NLP tasks with ambiguous categories or scores as preference learning, giving an example applications related to argument persuasiveness

Summary

Description

The disagreement between annotators stems from ambiguous or subjective annotation tasks as well as annotator errors. This improves the generalisation capability of the models and offers them a more graceful degradation with less ridiculous mistakes (Peterson et al, 2019; Guan et al, 2018) Some of these approaches can be used for their noise distillation capabilities, as their learning processes produce aggregated labels that leverage coder annotation patterns and the knowledge of the task accumulated by the model (Cao et al, 2018; Rodrigues and Pereira, 2018; Albarqouni et al, 2016; Chu et al, 2020). A second objective of the tutorial is to teach NLP researchers how they can augment their existing (deep) neural architectures to learn from data with disagreements

Learning outcomes

Part 1. Motivation and Early Approaches to Annotation Analysis

Part 2. Advanced Models of Annotation

Part 3. Learning with Multiple Annotators

Part 4. Practical Session

Audience prerequisites

Presenters

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Aggregating and Learning from Multiple Annotators

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2021
Citations: 1	License type: cc-by

Similar Papers

Learning from multiple annotators: Distinguishing good from random labelers
Filipe Rodrigues ... Bernardete Ribeiro
Pattern Recognition Letters | VOL. 34
Filipe Rodrigues, et. al.Filipe Rodrigues ... Bernardete Ribeiro
21 May 2013
Pattern Recognition Letters | VOL. 34

Learning from Multiple Annotators: When Data is Hard and Annotators are Unreliable
Chirine Wolley ... Mohamed Quafafou
-
Chirine Wolley, et. al.Chirine Wolley ... Mohamed Quafafou
01 Dec 2012
01 Dec 2012

Partial Sequence Labeling With Structured Gaussian Processes.
Xiaolei Lu ... Tommy W S Chow
IEEE transactions on neural networks and learning systems | VOL. 35
Xiaolei Lu, et. al.Xiaolei Lu ... Tommy W S Chow
01 Feb 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Eliciting and Learning with Soft Labels from Every Annotator
Katherine M Collins ... Umang Bhatt
Proceedings of the AAAI Conference on Human Computation and Crowdsourcing | VOL. 10
Katherine M Collins, et. al.Katherine M Collins ... Umang Bhatt
14 Oct 2022
Proceedings of the AAAI Conference on Human Computation and Crowdsourcing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Aggregating and Learning from Multiple Annotators

Abstract

Highlights

Summary

Talk to us

Similar Papers