Foreground and background separated image style transfer with a single text condition

Yue Yu,Jianming Wang,Nengli Li

doi:10.1016/j.imavis.2024.104956

Abstract

Traditional image-based style transfer requires additional reference style images, making it less user-friendly. Text-based methods are more convenient but suffer from issues like slow generation, unclear content, and poor quality. In this work, we propose a new style transfer method SA2-CS (means Semantic-Aware and Salient Attention CLIPStyler), which is based on the Comparative Language Image Pretraining (CLIP) model and a salient object detection network. Masks obtained from the salient object detection network are utilized to guide the style transfer process, and various strategies are employed to optimize according to different masks. Adequate experiments with diverse content images and style text descriptions were conducted, demonstrating our method's advantages: the network is easily trainable and converges rapidly; it achieves stable, superior generation results compared to other methods. Our approach addresses over-stylization issues in the foreground, enhances foreground-background contrast, and enables precise control over style transfer in various semantic regions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Foreground and background separated image style transfer with a single text condition

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: Feb 21, 2024
Citations: 1

Similar Papers

Bimodal Information Fusion Network for Salient Object Detection based on Transformer
Zhuo Wang ... Minyue Xiao
-
Zhuo Wang, et. al.Zhuo Wang ... Minyue Xiao
22 Jul 2022
22 Jul 2022

Attention guided contextual feature fusion network for salient object detection
Jin Zhang ... Yugen Yi
Image and Vision Computing | VOL. 117
Jin Zhang, et. al.Jin Zhang ... Yugen Yi
13 Nov 2021
Image and Vision Computing | VOL. 117

EMNet: Edge-guided multi-level network for salient object detection in low-light images
Lianghu Jing ... Bo Wang
Image and Vision Computing | VOL. 143
Lianghu Jing, et. al.Lianghu Jing ... Bo Wang
18 Feb 2024
Image and Vision Computing | VOL. 143

A parallel down-up fusion network for salient object detection in optical remote sensing images
Chongyi Li ... Yao Zhao
Neurocomputing | VOL. 415
Chongyi Li, et. al.Chongyi Li ... Yao Zhao
14 Sep 2020
Neurocomputing | VOL. 415

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Foreground and background separated image style transfer with a single text condition

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing