Exploiting Cultural Biases via Homoglyphs inText-to-Image Synthesis (Abstract Reprint)

Lukas Struppek,Kristian Kersting,Felix Friedrich,Patrick Schramowski,Manuel Brack,Dominik Hintersdorf

doi:10.24963/ijcai.2024/958

Abstract

Models for text-to-image synthesis, such as DALL-E 2 and Stable Diffusion, have recently drawn a lot of interest from academia and the general public. These models are capable of producing high-quality images that depict a variety of concepts and styles when conditioned on textual descriptions. However, these models adopt cultural characteristics associated with specific Unicode scripts from their vast amount of training data, which may not be immediately apparent. We show that by simply inserting single non-Latin characters in the textual description, common models reflect cultural biases in their generated images. We analyze this behavior both qualitatively and quantitatively and identify a model’s text encoder as the root cause of the phenomenon. Such behavior can be interpreted as a model feature, offering users a simple way to customize the image generation and reflect their own cultural background. Yet, malicious users or service providers may also try to intentionally bias the image generation. One goal might be to create racist stereotypes by replacing Latin characters with similarly-looking characters from non-Latin scripts, so-called homoglyphs. To mitigate such unnoticed script attacks, we propose a novel homoglyph unlearning method to fine-tune a text encoder, making it robust against homoglyph manipulations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploiting Cultural Biases via Homoglyphs inText-to-Image Synthesis (Abstract Reprint)

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Exploiting Cultural Biases via Homoglyphs in Text-to-Image Synthesis
Lukas Struppek ... Patrick Schramowski
Journal of Artificial Intelligence Research | VOL. 78
Lukas Struppek, et. al.Lukas Struppek ... Patrick Schramowski
18 Dec 2023
Journal of Artificial Intelligence Research | VOL. 78

Implementation of Culturally Responsive Teaching to Increase School Literacy Academic Literacy through Description Text Learning
Dhea Widya Purnamasari ... Sri Wahyuni
EDUTEC : Journal of Education And Technology | VOL. 7
Dhea Widya Purnamasari, et. al.Dhea Widya Purnamasari ... Sri Wahyuni
30 Jun 2024
EDUTEC : Journal of Education And Technology | VOL. 7

Development and deployment of a generative model-based framework for text to photorealistic image generation
Sharad Pande ... Ketan Kotecha
Neurocomputing | VOL. 463
Sharad Pande, et. al.Sharad Pande ... Ketan Kotecha
13 Aug 2021
Neurocomputing | VOL. 463

Realistic Image Generation from Text by Using BERT-Based Embedding
Sanghyuck Na ... Juntae Kim
Electronics | VOL. 11
Sanghyuck Na, et. al.Sanghyuck Na ... Juntae Kim
02 Mar 2022
Electronics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploiting Cultural Biases via Homoglyphs inText-to-Image Synthesis (Abstract Reprint)

Abstract

Talk to us

Similar Papers