Detection of Annotation Errors in Corpora

Markus Dickinson

doi:10.1111/lnc3.12129

Abstract

AbstractThis paper surveys methods for annotation error detection and correction. Methods can broadly be characterized as to whether they detect inconsistencies with respect to some statistical model based only on the corpus data or whether they detect inconsistencies with respect to a grammatical model, in general, some external information source. Two extended examples are presented, illustrating these different techniques: (1) the variation n‐gram method, which searches for inconsistences in annotation for identical strings; and (2) a method of ad hoc rule detection, for syntactic annotation, which compares treebank rules to a grammar to determine which are anomalous. Methods for detecting annotation errors have developed much over the last decade, and thus corpus practitioners can benefit greatly from them, while at the same time NLP researchers can learn more about the nuances of the annotation they use and see how error correction methods intersect with NLP techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detection of Annotation Errors in Corpora

Abstract

Talk to us

Similar Papers

More From: Language and Linguistics Compass

Lead the way for us

Journal: Language and Linguistics Compass	Publication Date: Mar 1, 2015
Citations: 16

Similar Papers

An Error Location and Correction Method for Memory Based on Data Similarity Analysis
Cuiping Shao ... Jiayan Fang
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 27
Cuiping Shao, et. al.Cuiping Shao ... Jiayan Fang
01 Oct 2019
IEEE Transactions on Very Large Scale Integration (VLSI) Systems | VOL. 27

Fourier transform over finite groups for error detection and error correction in computation channels
M.G Karpovsky ... E.A Trachtenberg
Information and Control | VOL. 40
M.G Karpovsky, et. al.M.G Karpovsky ... E.A Trachtenberg
01 Mar 1979
Information and Control | VOL. 40

<title>Error detection and correction using correlation of parameters in MPEG-2 video bitstream</title>
Dong-Hwan Choi ... Sang-Hak Lee
-
Dong-Hwan Choi, et. al.Dong-Hwan Choi ... Sang-Hak Lee
01 Dec 2002
01 Dec 2002

Migrating Electronic Systems from Fault Tolerant Computing to Error Resilience
Heinrich Theodor Vierhaus
-
Heinrich Theodor VierhausHeinrich Theodor Vierhaus
01 Sep 2018
01 Sep 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detection of Annotation Errors in Corpora

Abstract

Talk to us

Similar Papers

More From: Language and Linguistics Compass