Adversarial Self-Attention for Language Understanding

Hongqiu Wu,Min Zhang,Pengjun Xie,Fei Huang,Hai Zhao,Ruixue Ding

doi:10.1609/aaai.v37i11.26608

Abstract

Deep neural models (e.g. Transformer) naturally learn spurious features, which create a ``shortcut'' between the labels and inputs, thus impairing the generalization and robustness. This paper advances self-attention mechanism to its robust variant for Transformer-based pre-trained language models (e.g. BERT). We propose Adversarial Self-Attention mechanism (ASA), which adversarially biases the attentions to effectively suppress the model reliance on features (e.g. specific keywords) and encourage its exploration of broader semantics. We conduct comprehensive evaluation across a wide range of tasks for both pre-training and fine-tuning stages. For pre-training, ASA unfolds remarkable performance gain compared to naive training for longer steps. For fine-tuning, ASA-empowered models outweigh naive models by a large margin considering both generalization and robustness.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial Self-Attention for Language Understanding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 1

Similar Papers

Transformer-based deep neural network language models for Alzheimer\u2019s disease risk assessment from targeted speech
Alireza Roshanzamir ... Hamid Aghajan
BMC medical informatics and decision making | VOL. 21
Alireza Roshanzamir, et. al.Alireza Roshanzamir ... Hamid Aghajan
09 Mar 2021
BMC medical informatics and decision making | VOL. 21

Application of Transformer-Based Language Models to Detect Hate Speech in Social Media
Swapnanil Mukherjee ... Sujit Das
Journal of Computational and Cognitive Engineering | VOL. 2
Swapnanil Mukherjee, et. al.Swapnanil Mukherjee ... Sujit Das
17 Dec 2021
Journal of Computational and Cognitive Engineering | VOL. 2

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
...
-
, et. al. ...
23 Oct 2021
23 Oct 2021

SideControl: Controlled Open-domain Dialogue Generation via Additive Side Networks
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Self-Attention for Language Understanding

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence