Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification on Social Networks

Samuel Henrique Silva,Peyman Najafirad,Adel Aladdini,Arun Das

doi:10.1609/icwsm.v16i1.19350

Abstract

Advances in Artificial Intelligence (AI) have made it possible to automate human-level visual search and perception tasks on the massive sets of image data shared on social media on a daily basis. However, AI-based automated filters are highly susceptible to deliberate image attacks that can lead to content misclassification of cyberbulling, child sexual abuse material (CSAM), adult content, and deepfakes. One of the most effective methods to defend against such disturbances is adversarial training, but this comes at the cost of generalization for unseen attacks and transferability across models. In this article, we propose a robust defense against adversarial image attacks, which is model agnostic and generalizable to unseen adversaries. We begin with a baseline model, extracting the latent representations for each class and adaptively clustering the latent representations that share a semantic similarity. Next, we obtain the distributions for these clustered latent representations along with their originating images. We then learn semantic reconstruction dictionaries (SRD). We adversarially train a new model constraining the latent space representation to minimize the distance between the adversarial latent representation and the true cluster distribution. To purify the image, we decompose the input into low and high-frequency components. The high-frequency component is reconstructed based on the best SRD from the clean dataset. In order to evaluate the best SRD, we rely on the distance between the robust latent representations and semantic cluster distributions. The output is a purified image with no perturbations. Evaluations using comprehensive datasets including image benchmarks and social media images demonstrate that our proposed purification approach guards and enhances the accuracy of AI-based image filters for unlawful and harmful perturbed images considerably.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification on Social Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: May 31, 2022
Citations: 2

Similar Papers

Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification
...
-
, et. al. ...
08 May 2022
08 May 2022

“I Need You All to Understand How Pervasive This Issue Is”: User Efforts to Regulate Child Sexual Offending on Social Media
Michael Salter ... Elly Hanson
-
Michael Salter, et. al.Michael Salter ... Elly Hanson
04 Jun 2021
04 Jun 2021

Multiview Spectral Clustering via Structured Low-Rank Matrix Factorization.
Yang Wang ... Lin Wu
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Yang Wang, et. al.Yang Wang ... Lin Wu
04 Jan 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

People2Vec: Learning Latent Representations of Users Using Their Social-Media Activities
Sumeet Kumar ... Kathleen M Carley
-
Sumeet Kumar, et. al.Sumeet Kumar ... Kathleen M Carley
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Clustering of Robust Semantic Representations for Adversarial Image Purification on Social Networks

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media