Topic aware probing: From sentence length prediction to idiom identification how reliant are neural language models on topic?

Vasudevan Nedumpozhimana,John D Kelleher

doi:10.1017/nlp.2024.43

Abstract

Abstract Transformer-based neural language models achieve state-of-the-art performance on various natural language processing tasks. However, an open question is the extent to which these models rely on word-order/syntactic or word co-occurrence/topic-based information when processing natural language. This work contributes to this debate by addressing the question of whether these models primarily use topic as a signal, by exploring the relationship between Transformer-based models’ (BERT and RoBERTa’s) performance on a range of probing tasks in English, from simple lexical tasks such as sentence length prediction to complex semantic tasks such as idiom token identification, and the sensitivity of these tasks to the topic information. To this end, we propose a novel probing method which we call topic-aware probing. Our initial results indicate that Transformer-based models encode both topic and non-topic information in their intermediate layers, but also that the facility of these models to distinguish idiomatic usage is primarily based on their ability to identify and encode topic. Furthermore, our analysis of these models’ performance on other standard probing tasks suggests that tasks that are relatively insensitive to the topic information are also tasks that are relatively difficult for these models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Topic aware probing: From sentence length prediction to idiom identification how reliant are neural language models on topic?

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing

Lead the way for us

Journal: Natural Language Processing	Publication Date: Oct 25, 2024
License type: CC BY 4.0

Similar Papers

TopoBERT: Exploring the topology of fine-tuned word representations
Archit Rathore ... Vivek Srikumar
Information Visualization | VOL. 22
Archit Rathore, et. al.Archit Rathore ... Vivek Srikumar
01 May 2023
Information Visualization | VOL. 22

Adapting transformer-based language models for heart disease detection and risk factors extraction
Essam H Houssein ... Abdelmgeid A Ali
Journal of Big Data | VOL. 11
Essam H Houssein, et. al.Essam H Houssein ... Abdelmgeid A Ali
04 Apr 2024
Journal of Big Data | VOL. 11

The Generalization and Robustness of Transformer-Based Language Models on Commonsense Reasoning
Ke Shen
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Ke ShenKe Shen
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

GREEK-BERT: The Greeks visiting Sesame Street
John Koutsikakis ... Ilias Chalkidis
-
John Koutsikakis, et. al.John Koutsikakis ... Ilias Chalkidis
02 Sep 2020
02 Sep 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Topic aware probing: From sentence length prediction to idiom identification how reliant are neural language models on topic?

Abstract

Talk to us

Similar Papers

More From: Natural Language Processing