Reconstructing Training Data with Informed Adversaries

Borja Balle,Giovanni Cherubin,Jamie Hayes

doi:10.1109/sp46214.2022.9833677

Abstract

Given access to a machine learning model, can an adversary reconstruct the model’s training data? This work studies this question from the lens of a powerful informed adversary who knows all the training data points except one. By instantiating concrete attacks, we show it is feasible to reconstruct the remaining data point in this stringent threat model. For convex models (e.g. logistic regression), reconstruction attacks are simple and can be derived in closed-form. For more general models (e.g. neural networks), we propose an attack strategy based on training a reconstructor network that receives as input the weights of the model under attack and produces as output the target data point. We demonstrate the effectiveness of our attack on image classifiers trained on MNIST and CIFAR-10, and systematically investigate which factors of standard machine learning pipelines affect reconstruction success. Finally, we theoretically investigate what amount of differential privacy suffices to mitigate reconstruction attacks by informed adversaries. Our work provides an effective reconstruction attack that model developers can use to assess memorization of individual points in general settings beyond those considered in previous works (e.g. generative language models or access to training gradients); it shows that standard models have the capacity to store enough information to enable high-fidelity reconstruction of training data points; and it demonstrates that differential privacy can successfully mitigate such attacks in a parameter regime where utility degradation is minimal.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reconstructing Training Data with Informed Adversaries

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Confidence-ranked reconstruction of census microdata from published statistics
Travis Dick ... Zhiwei Steven Wu
Proceedings of the National Academy of Sciences | VOL. 120
Travis Dick, et. al.Travis Dick ... Zhiwei Steven Wu
17 Feb 2023
Proceedings of the National Academy of Sciences | VOL. 120

Differential privacy for learning vector quantization
Johannes Brinkrolf ... Barbara Hammer
Neurocomputing | VOL. 342
Johannes Brinkrolf, et. al.Johannes Brinkrolf ... Barbara Hammer
04 Feb 2019
Neurocomputing | VOL. 342

Failure of affine‐based reconstruction attack in regenerating vascular feature points
Mahshid Sadeghpour ... Kathy J Horadam
IET Biometrics | VOL. 10
Mahshid Sadeghpour, et. al.Mahshid Sadeghpour ... Kathy J Horadam
28 Jul 2021
IET Biometrics | VOL. 10

PUMA: Performance Unchanged Model Augmentation for Training Data Removal
Ga Wu ... Masoud Hashemi
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Ga Wu, et. al.Ga Wu ... Masoud Hashemi
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reconstructing Training Data with Informed Adversaries

Abstract

Talk to us

Similar Papers