A single-image GAN model using self-attention mechanism and DenseNets

Eyyup Yildiz,Mehmet Erkan Yuksel,Selcuk Sevgen

doi:10.1016/j.neucom.2024.127873

Abstract

Image generation from a single natural image using generative adversarial networks (GANs) has attracted extensive attention recently due to the GANs’ practical ability to produce photo-realistic images and their potential applications in computer vision. However, learning a powerful generative model that generates realistic, high-quality images from only a single natural image is still a challenging problem. Training GANs in limited data regimes often causes some issues, such as overfitting, memorization, training divergence, poor image quality, and a long training time. In this study, we investigated the state-of-the-art GAN models in computer vision tasks. We conducted several experiments to deeply understand the challenges of learning a powerful generative model. We introduced a novel unconditional GAN model that produces realistic, high-quality, diverse images based on a single training image. In our model, we employed a self-attention mechanism (SAM), a densely connected convolutional network (DenseNet) architecture, and a relativistic average least-squares GAN with gradient penalty (RaLSGAN-GP) for both the generator and discriminator networks to perform image generation tasks better. SAM controls the global contextual information level. It is complementary to convolutions for large feature maps and gives the generator and discriminator more capability to capture long-range dependencies in feature maps. It compensates for the long training time and low image quality issues. DenseNet connects each layer to every other layer in a feed-forward manner to ensure maximum information flow between layers in the network. It is highly parameter efficient and requires less computation to achieve high performance. It provides improved information and gradient flow throughout the network for easy training. It has a regularizing effect that reduces overfitting in image generation. RaLSGAN-GP further improves data generation quality and the stability of our model at no computational cost and provides much more stable training. Thanks to the appropriate combination of SAM, DenseNet, and RaLSGAN-GP, our model successfully generates realistic, high-quality, diverse images while maintaining the global context of the training image. We conducted experiments, user studies, and model evaluation methods to test our model’s performance and compared it with the previous well-known models on three datasets (Places, LSUN, ImageNet). We demonstrated our model’s capability in image synthesis and image manipulation tasks. In our experiments, we showed that our model utilized parameters more efficiently, prevented overfitting, better captured the internal patch statistics of images with complex structures and textures, achieved comparable performance in single image generation tasks, and had much better visual results than its competitive peers. User studies confirmed that the images generated by our model were commonly confused with the original images. Our model can provide a powerful tool for various image manipulation tasks as well as data augmentation in domains dealing with limited training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A single-image GAN model using self-attention mechanism and DenseNets

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Tek görüntü üretimi için öz-dikkat modüllü koşulsuz üretken bir model
Eyyüp Yildiz ... Selçuk Sevgen
Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi | VOL. -
Eyyüp Yildiz, et. al.Eyyüp Yildiz ... Selçuk Sevgen
15 Nov 2023
Ömer Halisdemir Üniversitesi Mühendislik Bilimleri Dergisi | VOL. -

Neuroevolution of Generative Adversarial Networks
Victor Costa ... João Correia
-
Victor Costa, et. al.Victor Costa ... João Correia
01 Jan 2020
01 Jan 2020

On Data Augmentation for GAN Training.
Ngoc-Trung Tran ... Viet-Hung Tran
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 30
Ngoc-Trung Tran, et. al.Ngoc-Trung Tran ... Viet-Hung Tran
01 Jan 2020
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society | VOL. 30

Enhancing classification of cells procured from bone marrow aspirate smears using generative adversarial networks and sequential convolutional neural network
Debapriya Hazra ... Woo Jin Kim
Computer Methods and Programs in Biomedicine | VOL. 224
Debapriya Hazra, et. al.Debapriya Hazra ... Woo Jin Kim
10 Jul 2022
Computer Methods and Programs in Biomedicine | VOL. 224

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A single-image GAN model using self-attention mechanism and DenseNets

Abstract

Talk to us

Similar Papers

More From: Neurocomputing