On Robustness and Sensitivity of a Neural Language Model: A Case Study on Italian L1 Learner Errors

Alessio Miaschi,Felice Dell'Orletta,Giulia Venturi,Dominique Brunato

doi:10.1109/taslp.2022.3226333

Abstract

In this paper, we propose a comprehensive linguistic study aimed at assessing the implicit behavior of one of the most prominent Neural Language Models (NLM) based on Transformer architectures, BERT Devlin et al., when dealing with a particular source of noisy data, namely essays written by L1 Italian learners containing a variety of errors targeting grammar, orthography and lexicon. Differently from previous works, we focus on the pre-training stage and we devise two complementary evaluation tasks aimed at assessing the impact of errors on sentence-level inner representations in terms of semantic robustness and linguistic sensitivity. While the first evaluation perspective is meant to probe the model's ability to encode the semantic similarity between sentences also in the presence of errors, the second type of probing task evaluates the influence of errors on BERT's implicit knowledge of a set of raw and morpho-syntactic properties of a sentence. Our experiments show that BERT's ability to compute sentence similarity and to correctly encode multi-leveled linguistic information of a sentence are differently modulated by the category of errors and that the error hierarchies in terms of robustness and sensitivity change across layer-wise representations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2023
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

On Robustness and Sensitivity of a Neural Language Model: A Case Study on Italian L1 Learner Errors

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

Overcoming linguistic barriers in code assistants: creating a Qlora adapter to improve support for russian-language code writing instructions
C.B Pronin ... Yu.N Strogov
Dynamics of Complex Systems - XXI century | VOL. -
C.B Pronin, et. al.C.B Pronin ... Yu.N Strogov
01 Jan 2024
Dynamics of Complex Systems - XXI century | VOL. -

Transformers4Rec: Bridging the Gap between NLP and Sequential / Session-Based Recommendation
Gabriel De Souza Pereira Moreira ... Ronay Ak
-
Gabriel De Souza Pereira Moreira, et. al.Gabriel De Souza Pereira Moreira ... Ronay Ak
13 Sep 2021
13 Sep 2021

A Social-aware Gaussian Pre-trained model for effective cold-start recommendation
Siwei Liu ... Iadh Ounis
Information Processing & Management | VOL. 61
Siwei Liu, et. al.Siwei Liu ... Iadh Ounis
14 Dec 2023
Information Processing & Management | VOL. 61

General Words Representation Method for Modern Language Model
Abbas Saliimi Lokman ... Mohamed Ariff Ameedeen
Journal of Telecommunication, Electronic and Computer Engineering (JTEC) | VOL. 15
Abbas Saliimi Lokman, et. al.Abbas Saliimi Lokman ... Mohamed Ariff Ameedeen
29 Mar 2023
Journal of Telecommunication, Electronic and Computer Engineering (JTEC) | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On Robustness and Sensitivity of a Neural Language Model: A Case Study on Italian L1 Learner Errors

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing