Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually

Mazal Bethany,Peyman Najafirad,Brandon Wherry,Nishant Vishwamitra

doi:10.1609/aaai.v38i2.27835

Abstract

Social media platforms are being increasingly used by malicious actors to share unsafe content, such as images depicting sexual activity, cyberbullying, and self-harm. Consequently, major platforms use artificial intelligence (AI) and human moderation to obfuscate such images to make them safer. Two critical needs for obfuscating unsafe images is that an accurate rationale for obfuscating image regions must be provided, and the sensitive regions should be obfuscated (e.g. blurring) for users' safety. This process involves addressing two key problems: (1) the reason for obfuscating unsafe images demands the platform to provide an accurate rationale that must be grounded in unsafe image-specific attributes, and (2) the unsafe regions in the image must be minimally obfuscated while still depicting the safe regions. In this work, we address these key issues by first performing visual reasoning by designing a visual reasoning model (VLM) conditioned on pre-trained unsafe image classifiers to provide an accurate rationale grounded in unsafe image attributes, and then proposing a counterfactual explanation algorithm that minimally identifies and obfuscates unsafe regions for safe viewing, by first utilizing an unsafe image classifier attribution matrix to guide segmentation for a more optimal subregion segmentation followed by an informed greedy search to determine the minimum number of subregions required to modify the classifier's output based on attribution score. Extensive experiments on uncurated data from social networks emphasize the efficacy of our proposed method. We make our code available at: https://github.com/SecureAIAutonomyLab/ConditionalVLM

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Social media platforms generate billions of dollars in revenue from U.S. youth: Findings from a simulated revenue model.
Amanda Raffoul ... S Bryn Austin
PLOS ONE | VOL. 18
Amanda Raffoul, et. al.Amanda Raffoul ... S Bryn Austin
27 Dec 2023
PLOS ONE | VOL. 18

The Advertising Policies of Major Social Media Platforms Overlook the Imperative to Restrict the Exposure of Children and Adolescents to the Promotion of Unhealthy Foods and Beverages.
Gary Sacks ... Evelyn Suk Yi Looi
International Journal of Environmental Research and Public Health | VOL. 17
Gary Sacks, et. al.Gary Sacks ... Evelyn Suk Yi Looi
01 Jun 2020
International Journal of Environmental Research and Public Health | VOL. 17

Indian Dermatologists Wield Technology to Combat COVID-19!
Aseem Sharma ... Deepak Jakhar
Indian dermatology online journal | VOL. 11
Aseem Sharma, et. al.Aseem Sharma ... Deepak Jakhar
01 Jan 2020
Indian dermatology online journal | VOL. 11

Towards adopting AI techniques for monitoring social media activities
Lina Muhammad Al-Ghamdi
Sustainable Engineering and Innovation | VOL. 3
Lina Muhammad Al-GhamdiLina Muhammad Al-Ghamdi
20 Jan 2021
Sustainable Engineering and Innovation | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image Safeguarding: Reasoning with Conditional Vision Language Model and Obfuscating Unsafe Content Counterfactually

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence