Robust Transfer Learning with Pretrained Language Models through Adapters

Wenjuan Han ,Yingnian Wu ,Bo Pang

doi:10.48448/1bqq-8b16

Abstract

Transfer learning with large pretrained transformer-based language models like BERT has become a dominating approach for most NLP tasks. Simply fine-tuning those large language models on downstream tasks or combining it with task-specific pretraining is often not robust. In particular, the performance considerably varies as the random seed changes or the number of pretraining and/or fine-tuning iterations varies, and the fine-tuned model is vulnerable to adversarial attack. We propose a simple yet effective adapter-based approach to mitigate these issues. Specifically, we insert small bottleneck layers (i.e., adapter) within each layer of a pretrained model, then fix the pretrained layers and train the adapter layers on the downstream task data, with (1) task-specific unsupervised pretraining and then (2) task-specific supervised training (e.g., classification, sequence labeling). Our experiments demonstrate that such a training scheme leads to improved stability and adversarial robustness in transfer learning to various downstream tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Transfer Learning with Pretrained Language Models through Adapters

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Towards an Enhanced Understanding of Bias in Pre-trained Neural Language Models: A Survey with Special Emphasis on Affective Bias
Anoop K ... Lajish V L
-
Anoop K, et. al. Anoop K ... Lajish V L
01 Jan 2021
01 Jan 2021

Understanding latent affective bias in large pre-trained neural language models
Anoop Kadan ... Lajish V.L
Natural Language Processing Journal | VOL. 7
Anoop Kadan, et. al.Anoop Kadan ... Lajish V.L
05 Mar 2024
Natural Language Processing Journal | VOL. 7

A self-supervised language model selection strategy for biomedical question answering
Negar Arabzadeh ... Ebrahim Bagheri
Journal of Biomedical Informatics | VOL. 146
Negar Arabzadeh, et. al.Negar Arabzadeh ... Ebrahim Bagheri
16 Sep 2023
Journal of Biomedical Informatics | VOL. 146

How transfer learning impacts linguistic knowledge in deep NLP models?
Nadir Durrani ... Hassan Sajjad
-
Nadir Durrani, et. al.Nadir Durrani ... Hassan Sajjad
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Transfer Learning with Pretrained Language Models through Adapters

Abstract

Talk to us

Similar Papers