IBD: An Interpretable Backdoor-Detection Method via Multivariate Interactions.

Yixiao Xu,Bangzhou Xin,Kangyi Ding,Xiaolei Liu

doi:10.3390/s22228697

Yixiao Xu, Bangzhou Xin + Show 2 more

Open Access

https://doi.org/10.3390/s22228697

Copy DOI

Abstract

Recent work has shown that deep neural networks are vulnerable to backdoor attacks. In comparison with the success of backdoor-attack methods, existing backdoor-defense methods face a lack of theoretical foundations and interpretable solutions. Most defense methods are based on experience with the characteristics of previous attacks, but fail to defend against new attacks. In this paper, we propose IBD, an interpretable backdoor-detection method via multivariate interactions. Using information theory techniques, IBD reveals how the backdoor works from the perspective of multivariate interactions of features. Based on the interpretable theorem, IBD enables defenders to detect backdoor models and poisoned examples without introducing additional information about the specific attack method. Experiments on widely used datasets and models show that IBD achieves a 78% increase in average in detection accuracy and an order-of-magnitude reduction in time cost compared with existing backdoor-detection methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IBD: An Interpretable Backdoor-Detection Method via Multivariate Interactions.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Journal: Sensors (Basel, Switzerland)	Publication Date: Nov 10, 2022
License type: CC BY 4.0

Similar Papers

One-to-N & N-to-One: Two Advanced Backdoor Attacks Against Deep Learning Models
Mingfu Xue ... Weiqiang Liu
IEEE Transactions on Dependable and Secure Computing | VOL. 19
Mingfu Xue, et. al.Mingfu Xue ... Weiqiang Liu
02 Oct 2020
IEEE Transactions on Dependable and Secure Computing | VOL. 19

Vulnerabilities of Deep Learning-Driven Semantic Communications to Backdoor (Trojan) Attacks
Yalin E Sagduyu ... Tugba Erpek
-
Yalin E Sagduyu, et. al.Yalin E Sagduyu ... Tugba Erpek
22 Mar 2023
22 Mar 2023

FDNet: Imperceptible backdoor attacks via frequency domain steganography and negative sampling
Liang Dong ... Zhidong Shen
Neurocomputing | VOL. 583
Liang Dong, et. al.Liang Dong ... Zhidong Shen
13 Mar 2024
Neurocomputing | VOL. 583

Backdoor Federated Learning-Based mmWave Beam Selection
Zhengming Zhang ... Xiangyu Zhang
IEEE Transactions on Communications | VOL. 70
Zhengming Zhang, et. al.Zhengming Zhang ... Xiangyu Zhang
01 Oct 2022
IEEE Transactions on Communications | VOL. 70

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IBD: An Interpretable Backdoor-Detection Method via Multivariate Interactions.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)