Correcting the distribution of batch normalization signals for Trojan mitigation

Xi Li,Zhen Xiang,David J Miller,George Kesidis

doi:10.1016/j.neucom.2024.128752

Abstract

Backdoor (Trojan) attacks represent a significant adversarial threat to deep neural networks (DNNs). In such attacks, the presence of an attacker’s backdoor trigger causes a test instance to be misclassified into the attacker’s chosen target class. Post-training mitigation methods aim to rectify these misclassifications, ensuring that poisoned models correctly classify backdoor-triggered samples. These methods require the defender to have access to a small, clean dataset and the potentially compromised DNN. However, most defenses rely on parameter fine-tuning, making their effectiveness dependent on the dataset size available to the defender. To overcome the limitations of existing approaches, we propose a method that rectifies misclassifications by correcting the altered distribution of internal layer activations of backdoor-triggered instances. Distribution alterations are corrected by applying simple transformations to internal activations. Notably, our method does not modify any trainable parameters of the DNN, yet it achieves generally good mitigation performance against various backdoor attacks and benchmarks. Consequently, our approach demonstrates robustness even with a limited amount of clean data, making it highly practical for real-world applications. The effectiveness of our approach is validated through both theoretical analysis and extensive experimentation. The appendix is provided as an electronic component and can be accessed via the link in the footnote.22https://arxiv.org/pdf/2308.09850. The source codes can be found in the link33https://github.com/lixi1994/BNA. at the footnote.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Correcting the distribution of batch normalization signals for Trojan mitigation

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Vulnerabilities of Deep Learning-Driven Semantic Communications to Backdoor (Trojan) Attacks
Yalin E Sagduyu ... Tugba Erpek
-
Yalin E Sagduyu, et. al.Yalin E Sagduyu ... Tugba Erpek
22 Mar 2023
22 Mar 2023

Test-Time Detection of Backdoor Triggers for Poisoned Deep Neural Networks
Xi Li ... David J Miller
-
Xi Li, et. al.Xi Li ... David J Miller
23 May 2022
23 May 2022

Backdoor Attacks on Image Classification Models in Deep Neural Networks
Quanxin Zhang ... Yajie Wang
Chinese Journal of Electronics | VOL. 31
Quanxin Zhang, et. al.Quanxin Zhang ... Yajie Wang
01 Mar 2022
Chinese Journal of Electronics | VOL. 31

PTB: Robust physical backdoor attacks against deep neural networks in real world
Mingfu Xue ... Weiqiang Liu
Computers & Security | VOL. 118
Mingfu Xue, et. al.Mingfu Xue ... Weiqiang Liu
15 Apr 2022
Computers & Security | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Correcting the distribution of batch normalization signals for Trojan mitigation

Abstract

Talk to us

Similar Papers

More From: Neurocomputing