PFB-Diff: Progressive Feature Blending diffusion for text-driven image editing

Wenjing Huang,Shikui Tu,Lei Xu

doi:10.1016/j.neunet.2024.106777

Abstract

Diffusion models have demonstrated their ability to generate diverse and high-quality images, sparking considerable interest in their potential for real image editing applications. However, existing diffusion-based approaches for local image editing often suffer from undesired artifacts due to the latent-level blending of the noised target images and diffusion latent variables, which lack the necessary semantics for maintaining image consistency. To address these issues, we propose PFB-Diff, a Progressive Feature Blending method for Diffusion-based image editing. Unlike previous methods, PFB-Diff seamlessly integrates text-guided generated content into the target image through multi-level feature blending. The rich semantics encoded in deep features and the progressive blending scheme from high to low levels ensure semantic coherence and high quality in edited images. Additionally, we introduce an attention masking mechanism in the cross-attention layers to confine the impact of specific words to desired regions, further improving the performance of background editing and multi-object replacement. PFB-Diff can effectively address various editing tasks, including object/background replacement and object attribute editing. Our method demonstrates its superior performance in terms of editing accuracy and image quality without the need for fine-tuning or training. Our implementation is available at https://github.com/CMACH508/PFB-Diff.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PFB-Diff: Progressive Feature Blending diffusion for text-driven image editing

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

TexFit: Text-Driven Fashion Image Editing with Diffusion Models
Tongxin Wang ... Mang Ye
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Tongxin Wang, et. al.Tongxin Wang ... Mang Ye
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou ... Rongsheng Zhang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Siyu Zou, et. al.Siyu Zou ... Rongsheng Zhang
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

CAISE: Conversational Agent for Image Search and Editing
Hyounghun Kim ... Franck Dernoncourt
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Hyounghun Kim, et. al.Hyounghun Kim ... Franck Dernoncourt
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Optimal transport-based unsupervised semantic disentanglement: A novel approach for efficient image editing in GANs
Yunqi Liu ... Xiaohui Cui
Displays | VOL. 80
Yunqi Liu, et. al.Yunqi Liu ... Xiaohui Cui
21 Oct 2023
Displays | VOL. 80

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PFB-Diff: Progressive Feature Blending diffusion for text-driven image editing

Abstract

Talk to us

Similar Papers

More From: Neural Networks