SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks

Kiran Karra,Chace Ashcraft,Cash Costello

doi:10.1109/dasc/picom/cbdcom/cy55231.2022.9927771

SanitAIs: Unsupervised Data Augmentation to Sanitize Trojaned Neural Networks

Kiran Karra, Chace Ashcraft + Show 1 more

Open Access

https://doi.org/10.1109/dasc/picom/cbdcom/cy55231.2022.9927771

Copy DOI

Publication Date: Sep 12, 2022

Affiliation: Johns Hopkins University

#Trojan Attacks #Neural Networks + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The application of self-supervised methods has resulted in broad improvements to neural network performance by leveraging large, untapped collections of unlabeled data to learn generalized underlying structure. In this work, we harness unsupervised data augmentation (UDA) to mitigate backdoor or Trojan attacks on deep neural networks. We show that UDA is more effective at removing the effects of a trigger than current state-of-the-art methods for both feature space and point triggers. These results demonstrate that UDA is both an effective and practical approach to mitigating the effects of backdoors on neural networks.

Full Text