SAMControl: Controlling Pose and Object for Image Editing with Soft Attention Mask

Yue Zhang,Chao Wang,Feifei Fang,Yunzhi Zhuge,Hehe Fan,Xiaojun Chang,Cheng Deng,Yi Yang

doi:10.1145/3702999

Abstract

To achieve content-consistent results in text-conditioned image editing, existing methods typically employ a reconstruction branch to capture the source image details via diffusion inversion and a generation branch to synthesize the target image based on the given textual prompt and the masked source image details. However, accurately segmenting source details is challenging with the current fixed-threshold mask strategy. Additionally, the inadequacies in the inversion process can lead to insufficient retention of source details. In this paper, we propose a method called SAMControl ( S oft A ttention M ask) to adaptively control the pose and object details for image editing. SAMControl dynamically learns flexible attention masks for different images at various diffusion steps. Furthermore, in the reconstruction branch, we utilize a direct inversion technique to ensure the fidelity of source details within SAM. Extensive qualitative and quantitative results demonstrate the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SAMControl: Controlling Pose and Object for Image Editing with Soft Attention Mask

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Similar Papers

CAISE: Conversational Agent for Image Search and Editing
Hyounghun Kim ... Franck Dernoncourt
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Hyounghun Kim, et. al.Hyounghun Kim ... Franck Dernoncourt
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou ... Rongsheng Zhang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Siyu Zou, et. al.Siyu Zou ... Rongsheng Zhang
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Robust image completion and masking with application to robotic bin picking
Sukhan Lee ... Soojin Lee
Robotics and Autonomous Systems | VOL. 131
Sukhan Lee, et. al.Sukhan Lee ... Soojin Lee
12 Jun 2020
Robotics and Autonomous Systems | VOL. 131

Inverse image editing
Shi-Min Hu ... Bin Liu
ACM Transactions on Graphics | VOL. 32
Shi-Min Hu, et. al.Shi-Min Hu ... Bin Liu
01 Nov 2013
ACM Transactions on Graphics | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SAMControl: Controlling Pose and Object for Image Editing with Soft Attention Mask

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications