SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation

Arbish Akram,Nazar Khan

doi:10.1109/tcsvt.2023.3255243

Abstract

Encoder-decoder based architecture has been widely used in the generator of generative adversarial networks for facial manipulation. However, we observe that the current architecture fails to recover the input image color, rich facial details such as skin color or texture and introduces artifacts as well. In this paper, we present a novel method named SARGAN that addresses the above-mentioned limitations from three perspectives. First, we employed spatial attention-based residual block instead of vanilla residual blocks to properly capture the expression-related features to be changed while keeping the other features unchanged. Second, we exploited a symmetric encoder-decoder network to attend facial features at multiple scales. Third, we proposed to train the complete network with a residual connection which relieves the generator of pressure to generate the input face image thereby producing the desired expression by directly feeding the input image towards the end of the generator. Both qualitative and quantitative experimental results show that our proposed model performs significantly better than state-of-the-art methods. In addition, existing models require much larger datasets for training but their performance degrades on out-of-distribution images. While SARGAN can be trained on smaller facial expressions datasets, which generalizes well on out-of-distribution images including human photographs, portraits, avatars and statues.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: Oct 1, 2023
Citations: 6

Similar Papers

Convolutional Residual Blocks With Edge Guidance for Image Denoising
K Shivarama Holla ... Bumshik Lee
-
K Shivarama Holla, et. al.K Shivarama Holla ... Bumshik Lee
19 Oct 2022
19 Oct 2022

Laser Resurfacing of the Neck with the Combined CO2/Er:YAG Laser
Mitchel P Goldman ... Nancy L Marchell
Dermatologic Surgery | VOL. 25
Mitchel P Goldman, et. al.Mitchel P Goldman ... Nancy L Marchell
01 Dec 1999
Dermatologic Surgery | VOL. 25

Small Scale Feature Propagation Using Deep Residual Learning for Diabetic Retinopathy Classification
Ali Can Metan ... Andrew Lambert
-
Ali Can Metan, et. al.Ali Can Metan ... Andrew Lambert
01 Jul 2019
01 Jul 2019

Pilot Study to Demonstrate Improvement in Skin Tone and Texture by Treatment with a 1064 nm Q-Switched Neodymium-Doped Yttrium Aluminum Garnet Laser.
Girish S Munavalli ... Hayley M Leight-Dunn
Journal of clinical medicine | VOL. 13
Girish S Munavalli, et. al.Girish S Munavalli ... Hayley M Leight-Dunn
28 Feb 2024
Journal of clinical medicine | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SARGAN: Spatial Attention-Based Residuals for Facial Expression Manipulation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology