TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

John Morris,Yanjun Qi,Di Jin,Eli Lifland,Jin Yong Yoo,Jake Grigsby

doi:10.18653/v1/2020.emnlp-demos.16

Abstract

While there has been substantial research using adversarial attacks to analyze NLP models, each attack is implemented in its own code repository. It remains challenging to develop NLP attacks and utilize them to improve model performance. This paper introduces TextAttack, a Python framework for adversarial attacks, data augmentation, and adversarial training in NLP. TextAttack builds attacks from four components: a goal function, a set of constraints, a transformation, and a search method. TextAttack’s modular design enables researchers to easily construct attacks from combinations of novel and existing components. TextAttack provides implementations of 16 adversarial attacks from the literature and supports a variety of models and datasets, including BERT and other transformers, and all GLUE tasks. TextAttack also includes data augmentation and adversarial training modules for using components of adversarial attacks to improve model accuracy and robustness.TextAttack is democratizing NLP: anyone can try data augmentation and adversarial training on any model or dataset, with just a few lines of code. Code and tutorials are available at https://github.com/QData/TextAttack.

Highlights

To encourage the development of the adversarial robustness field, we introduce TextAttack, a Python framework for adversarial attacks, data augmentation, and adversarial training in NLP
We presented TextAttack, an open-source framework for testing the robustness of NLP models
TextAttack defines an attack in four modules: a goal function, a list of constraints, a transformation, and a search method

Summary

Introduction

Over the last few years, there has been growing interest in investigating the adversarial robustness of NLP models, including new methods for generating adversarial examples and better approaches to defending against these adversaries (Alzantot et al, 2018; Jin et al, 2019; Kuleshov et al, 2018; Li et al, 2019; Gao et al, 2018; Wang et al, 2019; Ebrahimi et al, 2017; Zang et al, 2020; Pruthi et al, 2019). Implementing previous work as a baseline is often time-consuming and error-prone due to a lack of source code, and precisely replicating results is complicated by small details left out of the publication. These barriers make benchmark comparisons hard to trust and severely hinder the development of this field. To unify adversarial attack methods into one system, we decompose NLP attacks into four components: a goal function, a set of constraints, a transformation, and a search method. The attack attempts to perturb an input text such that the model output fulfills the goal function (i.e., indicating whether the attack is successful) and the perturbation adheres to the set of constraints (e.g., grammar constraint, semantic similarity constraint). A search method is used to find a sequence of transformations that produce a successful adversarial example

Objectives

Methods

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 227	License type: cc-by

Similar Papers

Better Robustness by More Coverage: Adversarial and Mixup Data Augmentation for Robust Finetuning
...
-
, et. al. ...
01 Aug 2021
01 Aug 2021

Adversarial Differentiable Data Augmentation for Autonomous Systems
Manli Shu ... Tom Goldstein
-
Manli Shu, et. al.Manli Shu ... Tom Goldstein
30 May 2021
30 May 2021

Adversarial defense via the data-dependent activation, total variation minimization, and adversarial training
Bao Wang ... Andrea L Bertozzi
Inverse Problems & Imaging | VOL. 15
Bao Wang, et. al.Bao Wang ... Andrea L Bertozzi
03 Aug 2020
Inverse Problems & Imaging | VOL. 15

Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models
Zhao Ren ... Bjorn Schuller
-
Zhao Ren, et. al.Zhao Ren ... Bjorn Schuller
01 May 2020
01 May 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP

Abstract

Highlights

Summary

Talk to us

Similar Papers