CC-CERT: A Probabilistic Approach to Certify General Robustness of Neural Networks

Mikhail Pautov,Nikita Muravev,Marina Munkhoeva,Ivan Oseledets,Nurislam Tursynbek,Aleksandr Petiushko

doi:10.1609/aaai.v36i7.20768

Abstract

In safety-critical machine learning applications, it is crucial to defend models against adversarial attacks --- small modifications of the input that change the predictions. Besides rigorously studied $\ell_p$-bounded additive perturbations, semantic perturbations (e.g. rotation, translation) raise a serious concern on deploying ML systems in real-world. Therefore, it is important to provide provable guarantees for deep learning models against semantically meaningful input transformations. In this paper, we propose a new universal probabilistic certification approach based on Chernoff-Cramer bounds that can be used in general attack settings. We estimate the probability of a model to fail if the attack is sampled from a certain distribution. Our theoretical findings are supported by experimental results on different datasets.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CC-CERT: A Probabilistic Approach to Certify General Robustness of Neural Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 6

Similar Papers

Abstract PO-078: Exploring adversarial image attacks on deep learning models in oncology
Marina Joel ... Roy Herbst
Clinical cancer research : an official journal of the American Association for Cancer Research | VOL. 27
Marina Joel, et. al.Marina Joel ... Roy Herbst
01 Mar 2021
Clinical cancer research : an official journal of the American Association for Cancer Research | VOL. 27

Additive Feature Attribution Explainable Methods to Craft Adversarial Attacks for Text Classification and Text Regression
Yidong Chai ... Hongyi Zhu
IEEE Transactions on Knowledge and Data Engineering | VOL. 35
Yidong Chai, et. al.Yidong Chai ... Hongyi Zhu
01 Dec 2023
IEEE Transactions on Knowledge and Data Engineering | VOL. 35

Towards Robust Ensemble Defense Against Adversarial Examples Attack
Nag Mani ... Teng-Sheng Moh
-
Nag Mani, et. al.Nag Mani ... Teng-Sheng Moh
01 Dec 2019
01 Dec 2019

Amora: Black-box Adversarial Morphing Attack
Run Wang ... Felix Juefei-Xu
-
Run Wang, et. al.Run Wang ... Felix Juefei-Xu
12 Oct 2020
12 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CC-CERT: A Probabilistic Approach to Certify General Robustness of Neural Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence