All that glitters...

Lars Borin

doi:10.7557/12.6348

All that glitters...

Lars Borin

Open Access

https://doi.org/10.7557/12.6348

Copy DOI

Journal: Nordlyd	Publication Date: Aug 30, 2022
Citations: 1	License type: CC BY-NC 4.0

#Gold Standard #Natural Language Processing + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Evaluation has emerged as a central concern in natural language processing (NLP) over the last few decades. Evaluation is done against a gold standard, a manually linguistically annotated dataset, which is assumed to provide the ground truth against which the accuracy of the NLP system can be assessed automatically. In this article, some methodological questions in connection with the creation of gold standard datasets are discussed, in particular (non-)expectations of linguistic expertise in annotators and the interannotator agreement measure standardly but unreflectedly used as a kind of quality index of NLP gold standards.

Full Text