Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models

Krishna Sri Ipsit Mantri ,Nevasini Sasikumar

doi:10.48550/arxiv.2306.05182

Abstract

Fashionable image generation aims to synthesize images of diverse fashion prevalent around the globe, helping fashion designers in real-time visualization by giving them a basic customized structure of how a specific design preference would look in real life and what further improvements can be made for enhanced customer satisfaction. Moreover, users can alone interact and generate fashionable images by just giving a few simple prompts. Recently, diffusion models have gained popularity as generative models owing to their flexibility and generation of realistic images from Gaussian noise. Latent diffusion models are a type of generative model that use diffusion processes to model the generation of complex data, such as images, audio, or text. They are called "latent" because they learn a hidden representation, or latent variable, of the data that captures its underlying structure. We propose a method exploiting the equivalence between diffusion models and energy-based models (EBMs) and suggesting ways to compose multiple probability distributions. We describe a pipeline on how our method can be used specifically for new fashionable outfit generation and virtual try-on using LLM-guided text-to-image generation. Our results indicate that using an LLM to refine the prompts to the latent diffusion model assists in generating globally creative and culturally diversified fashion styles and reducing bias.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models

Abstract

Talk to us

Similar Papers

More From: arXiv (Cornell University)

Lead the way for us

Similar Papers

Diffusion Models in Vision: A Survey.
Florinel-Alin Croitoru ... Vlad Hondru
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP
Florinel-Alin Croitoru, et. al.Florinel-Alin Croitoru ... Vlad Hondru
01 Sep 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP

This Microtubule Does Not Exist: Super-Resolution Microscopy Image Generation by a Diffusion Model.
Alon Saguy ... Yoav Shechtman
Small methods | VOL. -
Alon Saguy, et. al.Alon Saguy ... Yoav Shechtman
14 Oct 2024
Small methods | VOL. -

Z세대 패션디자인전공 학생들의 가상 이미지 수용자 인식 분석
Kyu Jin Lee
Liberal Arts Innovation Center | VOL. 9
Kyu Jin LeeKyu Jin Lee
30 Nov 2022
Liberal Arts Innovation Center | VOL. 9

Data augmentation-based enhanced fingerprint recognition using deep convolutional generative adversarial network and diffusion models
Yukai Liu
Applied and Computational Engineering | VOL. 52
Yukai LiuYukai Liu
27 Mar 2024
Applied and Computational Engineering | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models

Abstract

Talk to us

Similar Papers

More From: arXiv (Cornell University)