On the robustness of randomized classifiers to adversarial examples

Rafael Pinot,Florian Yger,Cédric Gouy-Pailler,Jamal Atif,Yann Chevaleyre,Laurent Meunier

doi:10.1007/s10994-022-06216-6

Abstract

This paper investigates the theory of robustness against adversarial attacks. We focus on randomized classifiers (i.e. classifiers that output random variables) and provide a thorough analysis of their behavior through the lens of statistical learning theory and information theory. To this aim, we introduce a new notion of robustness for randomized classifiers, enforcing local Lipschitzness using probability metrics. Equipped with this definition, we make two new contributions. The first one consists in devising a new upper bound on the adversarial generalization gap of randomized classifiers. More precisely, we devise bounds on the generalization gap and the adversarial gap i.e. the gap between the risk and the worst-case risk under attack) of randomized classifiers. The second contribution presents a yet simple but efficient noise injection method to design robust randomized classifiers. We show that our results are applicable to a wide range of machine learning models under mild hypotheses. We further corroborate our findings with experimental results using deep neural networks on standard image datasets, namely CIFAR-10 and CIFAR-100. On these tasks, we manage to design robust models that simultaneously achieve state-of-the-art accuracy (over 0.82 clean accuracy on CIFAR-10) and enjoy guaranteed robust accuracy bounds (0.45 against ell _{2} adversaries with magnitude 0.5 on CIFAR-10).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Aug 2, 2022
Citations: 2	License type: open-access

R Discovery Prime

R Discovery Prime

On the robustness of randomized classifiers to adversarial examples

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Generating watermarked adversarial texts
Mingjie Li ... Hanzhou Wu
Journal of Electronic Imaging | VOL. 32
Mingjie Li, et. al.Mingjie Li ... Hanzhou Wu
28 Mar 2023
Journal of Electronic Imaging | VOL. 32

Restoration of Adversarial Examples Using Image Arithmetic Operations
Kazim Ali ... Adnan N Quershi
Intelligent Automation & Soft Computing | VOL. 32
Kazim Ali, et. al.Kazim Ali ... Adnan N Quershi
01 Jan 2021
Intelligent Automation & Soft Computing | VOL. 32

FCDM: A Methodology Based on Sensor Pattern Noise Fingerprinting for Fast Confidence Detection to Adversarial Attacks
Yazhu Lan ... Guohe Zhang
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39
Yazhu Lan, et. al.Yazhu Lan ... Guohe Zhang
31 Jan 2020
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | VOL. 39

DTFA: Adversarial attack with discrete cosine transform noise and target features on deep neural networks
Dong Yang ... Wei Chen
IET Image Processing | VOL. 17
Dong Yang, et. al.Dong Yang ... Wei Chen
27 Dec 2022
IET Image Processing | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the robustness of randomized classifiers to adversarial examples

Abstract

Talk to us

Similar Papers

More From: Machine Learning