Efficient Structured Prediction with Transformer Encoders

Ali Basirat

doi:10.3384/nejlt.2000-1533.2024.4932

Abstract

Finetuning is a useful method for adapting Transformer-based text encoders to new tasks but can be computationally expensive for structured prediction tasks that require tuning at the token level. Furthermore, finetuning is inherently inefficient in updating all base model parameters, which prevents parameter sharing across tasks. To address these issues, we propose a method for efficient task adaptation of frozen Transformer encoders based on the local contribution of their intermediate layers to token representations. Our adapter uses a novel attention mechanism to aggregate intermediate layers and tailor the resulting representations to a target task. Experiments on several structured prediction tasks demonstrate that our method outperforms previous approaches, retaining over 99% of the finetuning performance at a fraction of the training cost. Our proposed method offers an efficient solution for adapting frozen Transformer encoders to new tasks, improving performance and enabling parameter sharing across different tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient Structured Prediction with Transformer Encoders

Abstract

Talk to us

Similar Papers

More From: Northern European Journal of Language Technology

Lead the way for us

Journal: Northern European Journal of Language Technology	Publication Date: Mar 14, 2024
License type: CC BY 4.0

Similar Papers

Adversarial Structure Matching for Structured Prediction Tasks
Jyh-Jing Hwang ... Stella X Yu
-
Jyh-Jing Hwang, et. al.Jyh-Jing Hwang ... Stella X Yu
01 Jun 2019
01 Jun 2019

Hands-on Learning to Search for Structured Prediction
Hal Daumé Iii ... John Langford
-
Hal Daumé Iii, et. al.Hal Daumé Iii ... John Langford
01 Jan 2015
01 Jan 2015

Adversarial Attack and Defense of Structured Prediction Models
Wenjuan Han ... Liwen Zhang
-
Wenjuan Han, et. al.Wenjuan Han ... Liwen Zhang
01 Jan 2020
01 Jan 2020

Multi-View Cross-Lingual Structured Prediction with Minimum Supervision
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Structured Prediction with Transformer Encoders

Abstract

Talk to us

Similar Papers

More From: Northern European Journal of Language Technology