Attention Meets Perturbations: Robust and Interpretable Attention With Adversarial Training

Shunsuke Kitada,Hitoshi Iyatomi

doi:10.1109/access.2021.3093456

Abstract

Although attention mechanisms have been applied to a variety of deep learning models and have been shown to improve the prediction performance, it has been reported to be vulnerable to perturbations to the mechanism. To overcome the vulnerability to perturbations in the mechanism, we are inspired by adversarial training (AT), which is a powerful regularization technique for enhancing the robustness of the models. In this paper, we propose a general training technique for natural language processing tasks, including AT for attention (Attention AT) and more interpretable AT for attention (Attention iAT). The proposed techniques improved the prediction performance and the model interpretability by exploiting the mechanisms with AT. In particular, Attention iAT boosts those advantages by introducing adversarial perturbation, which enhances the difference in the attention of the sentences. Evaluation experiments with ten open datasets revealed that AT for attention mechanisms, especially Attention iAT, demonstrated (1) the best performance in nine out of ten tasks and (2) more interpretable attention (i.e., the resulting attention correlated more strongly with gradient-based word importance) for all tasks. Additionally, the proposed techniques are (3) much less dependent on perturbation size in AT. Our code is available at https://github.com/shunk031/attention-meets-perturbation

Highlights

A TTENTION mechanisms [1] are widely applied in natural language processing (NLP) field through deep neural networks (DNNs)
COMMON MODEL ARCHITECTURE Our goal is to improve the performance of NLP models by aiming at the robustness of the attention mechanisms
For 20 Newsgroups (20News) and AG News (AGNews) in the binary classification (BC) and bAbI task 1 in question answering (QA), the conventional techniques, including the Vanilla model, were sufficiently accurate, so the performance improvement of the proposed techniques to the tasks was limited to some extent

Summary

Introduction

A TTENTION mechanisms [1] are widely applied in natural language processing (NLP) field through deep neural networks (DNNs). The Transformer [8] and its follow-up models [12], [13] have self-attention mechanisms that estimate the relationship of each word in the sentence. These models take advantage of the effect of the mechanisms and have shown promising performances. There is no doubt that the effect of the mechanisms is extremely large They are not easy to train, as they require huge amounts of GPU memory to maintain the weights of the model. The application of attention mechanisms to DNN models, such as RNN and CNN models, which have been widely used and do not require relatively high training requirements, has not been sufficiently studied

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 15	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Attention Meets Perturbations: Robust and Interpretable Attention With Adversarial Training

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Making attention mechanisms more robust and interpretable with virtual adversarial training
Shunsuke Kitada ... Hitoshi Iyatomi
Applied Intelligence | VOL. 53
Shunsuke Kitada, et. al.Shunsuke Kitada ... Hitoshi Iyatomi
28 Nov 2022
Applied Intelligence | VOL. 53

Targeted adversarial attacks on wind power forecasts
René Heinrich ... Malte Lehna
Machine Learning | VOL. 113
René Heinrich, et. al.René Heinrich ... Malte Lehna
13 Sep 2023
Machine Learning | VOL. 113

Robust Weight Perturbation for Adversarial Training
Chaojian Yu ... Mingming Gong
-
Chaojian Yu, et. al.Chaojian Yu ... Mingming Gong
01 Jul 2022
01 Jul 2022

Robustness May be at Odds with Stability in Adversarial Training based Feature Selection?
Yue Liu ... Yun Li
-
Yue Liu, et. al.Yue Liu ... Yun Li
01 Nov 2022
01 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention Meets Perturbations: Robust and Interpretable Attention With Adversarial Training

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access