FICE: Text-conditioned fashion-image editing with guided GAN inversion

Martin Pernuš,Clinton Fookes,Vitomir Štruc,Simon Dobrišek

doi:10.1016/j.patcog.2024.111022

Abstract

Fashion-image editing is a challenging computer-vision task where the goal is to incorporate selected apparel into a given input image. Most existing techniques, known as Virtual Try-On methods, deal with this task by first selecting an example image of the desired apparel and then transferring the clothing onto the target person. Conversely, in this paper, we consider editing fashion images with text descriptions. Such an approach has several advantages over example-based virtual try-on techniques: (i) it does not require an image of the target fashion item, and (ii) it allows the expression of a wide variety of visual concepts through the use of natural language. Existing image-editing methods that work with language inputs are heavily constrained by their requirement for training sets with rich attribute annotations or they are only able to handle simple text descriptions. We address these constraints by proposing a novel text-conditioned editing model called FICE (Fashion Image CLIP Editing) that is capable of handling a wide variety of diverse text descriptions to guide the editing procedure. Specifically, with FICE, we extend the common GAN-inversion process by including semantic, pose-related, and image-level constraints when generating images. We leverage the capabilities of the CLIP model to enforce the text-provided semantics, due to its impressive image–text association capabilities. We furthermore propose a latent-code regularization technique that provides the means to better control the fidelity of the synthesized images. We validate the FICE through rigorous experiments on a combination of VITON images and Fashion-Gen text descriptions and in comparison with several state-of-the-art, text-conditioned, image-editing approaches. Experimental results demonstrate that the FICE generates very realistic fashion images and leads to better editing than existing, competing approaches. The source code is publicly available from: https://github.com/MartinPernus/FICE.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FICE: Text-conditioned fashion-image editing with guided GAN inversion

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Sep 14, 2024
Citations: 2

Similar Papers

Stylized Text-to-Fashion Image Generation
Huixian Zhang ... Shuhui Jiang
-
Huixian Zhang, et. al.Huixian Zhang ... Shuhui Jiang
15 Dec 2021
15 Dec 2021

SPIRIT: Style-guided Patch Interaction for Fashion Image Retrieval with Text Feedback
Yanzhe Chen ... Jiahuan Zhou
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20
Yanzhe Chen, et. al.Yanzhe Chen ... Jiahuan Zhou
08 Mar 2024
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 20

Junior High School Students’ Writing Mastery on English Decriptive and Recount Texts
Febri Rama Suci ... Darmayenti Darmayenti
Turast : Jurnal Penelitian dan Pengabdian | VOL. 7
Febri Rama Suci, et. al.Febri Rama Suci ... Darmayenti Darmayenti
30 Jul 2019
Turast : Jurnal Penelitian dan Pengabdian | VOL. 7

Fashion clothing matching by global-local feature optimization
Yunzhu Wang ... Qingsong Huang
Journal of Image and Graphics | VOL. 28
Yunzhu Wang, et. al.Yunzhu Wang ... Qingsong Huang
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FICE: Text-conditioned fashion-image editing with guided GAN inversion

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition