Adversarial Robustness through Disentangled Representations

Shuo Yang,Yunhe Wang,Tianyu Guo,Chang Xu

doi:10.1609/aaai.v35i4.16424

Abstract

Despite the remarkable empirical performance of deep learning models, their vulnerability to adversarial examples has been revealed in many studies. They are prone to make a susceptible prediction to the input with imperceptible adversarial perturbation. Although recent works have remarkably improved the model's robustness under the adversarial training strategy, an evident gap between the natural accuracy and adversarial robustness inevitably exists. In order to mitigate this problem, in this paper, we assume that the robust and non-robust representations are two basic ingredients entangled in the integral representation. For achieving adversarial robustness, the robust representations of natural and adversarial examples should be disentangled from the non-robust part and the alignment of the robust representations can bridge the gap between accuracy and robustness. Inspired by this motivation, we propose a novel defense method called Deep Robust Representation Disentanglement Network (DRRDN). Specifically, DRRDN employs a disentangler to extract and align the robust representations from both adversarial and natural examples. Theoretical analysis guarantees the mitigation of the trade-off between robustness and accuracy with good disentanglement and alignment performance. Experimental results on benchmark datasets finally demonstrate the empirical superiority of our method.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adversarial Robustness through Disentangled Representations

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 12

Similar Papers

Interpreting Adversarial Examples and Robustness for Deep Learning-Based Auto-Driving Systems
Ke Wang ... Jinyi Long
Intelligent Transportation Systems, IEEE Transactions on | VOL. 23
Ke Wang, et. al.Ke Wang ... Jinyi Long
01 Jul 2022
Intelligent Transportation Systems, IEEE Transactions on | VOL. 23

Structure-Aware Stabilization of Adversarial Robustness with Massive Contrastive Adversaries
Shuo Yang ... Bo Du
-
Shuo Yang, et. al.Shuo Yang ... Bo Du
01 Dec 2021
01 Dec 2021

Provable Unrestricted Adversarial Training without Compromise with Generalizability.
Lilin Zhang ... Philip S Yu
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP
Lilin Zhang, et. al.Lilin Zhang ... Philip S Yu
01 Jan 2024
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP

Adversarial Examples in Deep Neural Networks: An Overview
Emilio Rafael Balda ... Rudolf Mathar
-
Emilio Rafael Balda, et. al.Emilio Rafael Balda ... Rudolf Mathar
24 Oct 2019
24 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adversarial Robustness through Disentangled Representations

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence