LexFit: Lexical Fine-Tuning of Pretrained Language Models

Anna Korhonen ,Ivan Vulić ,Goran Glavaš ,Edoardo Ponti

doi:10.48448/2skf-gv34

Abstract

Transformer-based language models (LMs) pretrained on large text collections implicitly store a wealth of lexical semantic knowledge, but it is non-trivial to extract that knowledge effectively from their parameters. Inspired by prior work on semantic specialization of static word embedding (WE) models, we show that it is possible to expose and enrich lexical knowledge from the LMs, that is, to specialize them to serve as effective and universal "decontextualized" word encoders even when fed input words "in isolation" (i.e., without any context). Their transformation into such word encoders is achieved through a simple and efficient lexical fine-tuning procedure (termed LexFit) based on dual-encoder network structures. Further, we show that LexFit can yield effective word encoders even with limited lexical supervision and, via cross-lingual transfer, in different languages without any readily available external knowledge. Our evaluation over four established, structurally different lexical-level tasks in 8 languages indicates the superiority of LexFit-based WEs over standard static WEs (e.g., fastText) and WEs from vanilla LMs. Other extensive experiments and ablation studies further profile the LexFit framework, and indicate best practices and performance variations across LexFit variants, languages, and lexical tasks, also directly questioning the usefulness of traditional WE models in the era of large neural models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LexFit: Lexical Fine-Tuning of Pretrained Language Models

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An empirical assessment of different word embedding and deep learning models for bug assignment
Rongcun Wang ... Rubing Huang
The Journal of Systems & Software | VOL. 210
Rongcun Wang, et. al.Rongcun Wang ... Rubing Huang
06 Jan 2024
The Journal of Systems & Software | VOL. 210

A reproducible survey on word embeddings and ontology-based methods for word similarity: Linear combinations outperform the state of the art
Juan J Lastra-Díaz ... Eneko Agirre
Engineering Applications of Artificial Intelligence | VOL. 85
Juan J Lastra-Díaz, et. al.Juan J Lastra-Díaz ... Eneko Agirre
01 Aug 2019
Engineering Applications of Artificial Intelligence | VOL. 85

Specializing Distributional Vectors of All Words for Lexical Entailment
Aishwarya Kamath ... Edoardo Maria Ponti
-
Aishwarya Kamath, et. al.Aishwarya Kamath ... Edoardo Maria Ponti
01 Jan 2019
01 Jan 2019

An efficient multiple-word embedding-based cross-domain feature extraction and aspect sentiment classification
Monika Agrawal ... Nageswara Rao Moparthi
Measurement: Sensors | VOL. 28
Monika Agrawal, et. al.Monika Agrawal ... Nageswara Rao Moparthi
26 Jun 2023
Measurement: Sensors | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LexFit: Lexical Fine-Tuning of Pretrained Language Models

Abstract

Talk to us

Similar Papers