Overview of ADoBo 2021:: Automatic Detection of Unassimilated Borrowings in the Spanish Press

Elena Álvarez Mellado ,Julio Gonzalo Arroyo ,Luis Espinosa-Anke ,Constantine Lignos ,Jordi Porta

doi:10.26342/2021-67-24

Abstract

espanolEn este articulo presentamos los resultados de ADoBo 2021, la tarea compartida de IberLEF 2021 sobre deteccion de prestamos lexicos en la prensa espanola. En esta tarea abordamos la deteccion de prestamos como un problema de etiquetado de secuencias. A los participantes de la tarea se les proporciono un corpus de prensa espanola anotado con prestamos lexicos no asimilados (mayoritariamente anglicismos) siguiendo el esquema BIO. Recibimos nueve sistemas distintos provenientes de cuatro equipos diferentes. Los resultados obtenidos oscilan entre los 37 y los 85 puntos de valor F1, lo que indica que la deteccion de prestamos lexicos es un problema no resuelto (sobre todo cuando se abordan prestamos no vistos anteriormente) y que el trabajo lexicografico tradicional podria beneficiarse de incorporar las tecnicas actuales del PLN. EnglishThis paper summarizes the main findings of the ADoBo 2021 shared task, proposed in the context of IberLef 2021. In this task, we invited participants to detect lexical borrowings (coming mostly from English) in Spanish newswire texts. This task was framed as a sequence classification problem using BIO encoding. We provided participants with an annotated corpus of lexical borrowings which we split into training, development and test splits. We received submissions from 4 teams with 9 different system runs overall. The results, which range from F1 scores of 37 to 85, suggest that this is a challenging task, especially when out-of-domain or OOV words are considered, and that traditional methods informed with lexicographic information would benefit from taking advantage of current NLP trends.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Overview of ADoBo 2021:: Automatic Detection of Unassimilated Borrowings in the Spanish Press

Abstract

Talk to us

Similar Papers

More From: Procesamiento Del Lenguaje Natural

Lead the way for us

Similar Papers

Procesamiento de Expresiones Multipalabra en gallego mediante Aprendizaje Profundo
...
Procesamiento Del Lenguaje Natural | VOL. 67
, et. al. ...
06 Sep 2021
Procesamiento Del Lenguaje Natural | VOL. 67

Overview of the EmoEvalEs task on emotion detection for Spanish at IberLEF 2021
...
Procesamiento Del Lenguaje Natural | VOL. 67
, et. al. ...
06 Sep 2021
Overview of the EmoEvalEs task on emotion detection for Spanish at IberLEF 2021
...

AutoPunct: A BERT-based Automatic Punctuation and Capitalisation System for Spanish and Basque
...
Procesamiento Del Lenguaje Natural | VOL. 67
, et. al. ...
06 Sep 2021
Procesamiento Del Lenguaje Natural | VOL. 67

Overview of Rest-Mex at IberLEF 2021: Recommendation System for Text Mexican Tourism
...
Procesamiento Del Lenguaje Natural | VOL. 67
, et. al. ...
06 Sep 2021
Overview of Rest-Mex at IberLEF 2021: Recommendation System for Text Mexican Tourism
...

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Overview of ADoBo 2021:: Automatic Detection of Unassimilated Borrowings in the Spanish Press

Abstract

Talk to us

Similar Papers

More From: Procesamiento Del Lenguaje Natural