Trends and Challenges of Text-to-Image Generation: Sustainability Perspective

Dora Ivezić,Marina Bagić Babac

doi:10.2478/crdj-2023-0004

Dora Ivezić, Marina Bagić Babac

Open Access

https://doi.org/10.2478/crdj-2023-0004

Copy DOI

Journal: Croatian Regional Development Journal	Publication Date: Jun 1, 2023
Citations: 2	License type: CC BY-NC-ND 4.0

Affiliation: University of Zagreb

Abstract

Abstract Text-to-image generation is a rapidly growing field that aims to generate images from textual descriptions. This paper provides a comprehensive overview of the latest trends and developments, highlighting their importance and relevance in various domains, such as art, photography, marketing, and learning. The paper describes and compares various text-to-image models and discusses the challenges and limitations of this field. The findings of this paper demonstrate that recent advancements in deep learning and computer vision have led to significant progress in text-to-image models, enabling them to generate high-quality images from textual descriptions. However, challenges such as ensuring the legality and ethical implications of the final products generated by these models need to be addressed. This paper provides insights into these challenges and suggests future directions for this field. In addition, this study emphasises the need for a sustainability-oriented approach in the text-to-image domain. As text-to-image models advance, it is crucial to conscientiously assess their impact on ecological, cultural, and societal dimensions. Prioritising ethical model use while being mindful of their carbon footprint and potential effects on human creativity becomes crucial for sustainable progress.

Full Text