Dual adversarial attacks: Fooling humans and classifiers

Johannes Schneider,Giovanni Apruzzese

doi:10.1016/j.jisa.2023.103502

Abstract

Adversarial samples mostly aim at fooling machine learning (ML) models. They often involve minor pixel-based perturbations that are imperceptible to human observers. In this work, adversarial samples should fool both humans and ML models, which is important in two-stage decision processes. We perform changes on a higher abstraction level so that a target sample exhibits properties of a desired sample. Technically, we contribute by deriving a regularization scheme for autoencoders incorporating a classifier loss for smoothly interpolating between wildly different samples. The realism and effectiveness of generated samples are confirmed with a user study and other evaluations. Our experiments consider neural networks of four architectures, assessed on MNIST, FashionMNIST, QuickDraw and CIFAR-10. Results show that our scheme leads to superior performance compared to existing interpolation techniques: on average, other methods have an 11% higher failure rate when producing a sample that is of any of two interpolated classes. Furthermore, our attacks work in both white- and black-box settings.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual adversarial attacks: Fooling humans and classifiers

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Information Security and Applications

Lead the way for us

Journal: Journal of Information Security and Applications	Publication Date: May 3, 2023
License type: cc-by

Similar Papers

Moving Target Defense against Adversarial Machine Learning
Anshuman Chhabra ... Prasant Mohapatra
-
Anshuman Chhabra, et. al.Anshuman Chhabra ... Prasant Mohapatra
15 Nov 2021
15 Nov 2021

Perception without preconception: comparison between the human and machine learner in recognition of tissues from histological sections
Sanghita Barui ... Parikshit Sanyal
Scientific Reports | VOL. 12
Sanghita Barui, et. al.Sanghita Barui ... Parikshit Sanyal
30 Sep 2022
Scientific Reports | VOL. 12

State-of-the-Art Review of Machine Learning Models in Civil Engineering: Based on DAMIE Classification Tree
Jaehyun Kim ... Donghwi Jung
-
Jaehyun Kim, et. al.Jaehyun Kim ... Donghwi Jung
15 May 2023
15 May 2023

A Review of Machine Learning Models for Harmful Algal Bloom Monitoring in Freshwater Systems
Ibrahim Busari ... Brian E Haggard
Journal of Natural Resources and Agricultural Ecosystems | VOL. 1
Ibrahim Busari, et. al.Ibrahim Busari ... Brian E Haggard
01 Jan 2023
Journal of Natural Resources and Agricultural Ecosystems | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual adversarial attacks: Fooling humans and classifiers

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Information Security and Applications