SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution

Qiqi Bao,Bowen Gang,Qingmin Liao,Yunmeng Liu,Wenming Yang

doi:10.1109/tmm.2023.3238522

Abstract

Numerous CNN-based algorithms have been proposed to reconstruct high-quality face images. However, the inability of convolution operation to model long-distance relationships limits the performance of the CNN-based methods. Moreover, in the high-resolution (HR) image reconstruction stage, with the well decoded feature representations, more efficient architecture design can be explored to synthesize pixel-level image details. In this work, we propose a spatial attention-guided CNN-Transformer aggregation network (SCTANet) for face image super-resolution (FSR) tasks. The core component in the deep feature extraction stage is the Hybrid Attention Aggregation (HAA) block. The HAA block has two parallel paths, one for the Residual Spatial Attention (RSA) block, the other for the Multi-scale Patch embedding and Spatial-attention Masked Transformer (MPSMT) block. The HAA block combines the strengths of CNN and transformer to effectively exploit both local and global information. For the reconstruction stage, we propose to use the Sub-pixel MLP-based Upsampling (SMU) module instead of the conventional CNN architecture. The SMU module promotes the reconstruction of pixel-level image details and reduces computational complexity. Extensive experiments on both synthetic and real-world face datasets demonstrate the superiority of our proposed SCTANet over state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Jan 1, 2023
Citations: 13

Similar Papers

DHBSR: A deep hybrid representation-based network for blind image super resolution
Alireza Esmaeilzehi ... M Omair Ahmad
Computer Vision and Image Understanding | VOL. 246
Alireza Esmaeilzehi, et. al.Alireza Esmaeilzehi ... M Omair Ahmad
28 May 2024
Computer Vision and Image Understanding | VOL. 246

Ultralight-Weight Three-Prior Convolutional Neural Network for Single Image Super Resolution
Alireza Esmaeilzehi ... M Omair Ahmad
IEEE Transactions on Artificial Intelligence | VOL. 4
Alireza Esmaeilzehi, et. al.Alireza Esmaeilzehi ... M Omair Ahmad
01 Dec 2023
IEEE Transactions on Artificial Intelligence | VOL. 4

SRNSSI: A Deep Light-Weight Network for Single Image Super Resolution Using Spatial and Spectral Information
Alireza Esmaeilzehi ... M.N.S Swamy
IEEE Transactions on Computational Imaging | VOL. 7
Alireza Esmaeilzehi, et. al.Alireza Esmaeilzehi ... M.N.S Swamy
01 Jan 2020
IEEE Transactions on Computational Imaging | VOL. 7

Dual attention-guided feature pyramid network for instance segmentation of group pigs
Zhiwei Hu ... Tiantian Lou
Computers and Electronics in Agriculture | VOL. 186
Zhiwei Hu, et. al.Zhiwei Hu ... Tiantian Lou
19 May 2021
Computers and Electronics in Agriculture | VOL. 186

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SCTANet: A Spatial Attention-Guided CNN-Transformer Aggregation Network for Deep Face Image Super-Resolution

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia