Generative Adversarial Networks Research Articles

Abstract Facial expression generation technology has achieved notable progress in computer vision and artificial intelligence. However, challenges persist regarding background consistency, expression clarity, and detailed representation. Additionally, the instability of generative adversarial networks (GANs) during training affects both image quality and diversity. While diffusion models have demonstrated potential advantages over GANs, research on controllable expression generation remains limited. To address these challenges, this paper proposes a highly natural facial expression generation method based on denoising diffusion implicit models (DDIM) with embedded vein features. This approach avoids adversarial training by employing gradual diffusion to generate specific expressions, thereby enhancing both the diversity and authenticity of the images. Vein features are introduced and embedded within the generated expression images to protect the intellectual property of algorithm-generated digital resources. Firstly, image and expression text guide words are combined as conditional inputs to improve the authenticity and diversity of the generated images. Secondly, a classification coding network is introduced to guide expression generation, thus enhancing the accuracy and consistency of the produced expressions. Furthermore, this paper proposes a vein feature fusion method based on multi-directional local dynamic feature coding operator (MLDFO) and integrates DDIM with frequency-domain watermarking technology to achieve image intellectual property protection. Experimental results demonstrate the effectiveness of this method across several public datasets, including FFHQ, CelebA, FV-USM, and SDUMLA-HMT. Notably, in the CelebA dataset, the average expression recognition rate increased by 11.41%, with a 100.00% recognition rate for happy expressions. The generated expression images exhibit a high degree of authenticity and consistency, and the video conversion tests reveal a natural and smooth effect. These results confirm that this method not only advances facial expression generation technology but also significantly enhances the steganographic protection of images.

Read full abstract

The findings of the 2023 AAPM Grand Challenge on Deep Generative Modeling for Learning Medical Image Statistics are reported in this SpecialReport. The goal of this challenge was to promote the development of deep generative models for medical imaging and to emphasize the need for their domain-relevant assessments via the analysis of relevant imagestatistics. As part of this Grand Challenge, a common training dataset and an evaluation procedure was developed for benchmarking deep generative models for medical image synthesis. To create the training dataset, an established 3D virtual breast phantom was adapted. The resulting dataset comprised about 108000 images of size 512 512. For the evaluation of submissions to the Challenge, an ensemble of 10000 DGM-generated images from each submission was employed. The evaluation procedure consisted of two stages. In the first stage, a preliminary check for memorization and image quality (via the Fréchet Inception Distance [FID]) was performed. Submissions that passed the first stage were then evaluated for the reproducibility of image statistics corresponding to several feature families including texture, morphology, image moments, fractal statistics, and skeleton statistics. A summary measure in this feature space was employed to rank the submissions. Additional analyses of submissions was performed to assess DGM performance specific to individual feature families, the four classes in the training data, and also to identify various artifacts. Fifty-eight submissions from 12 unique users were received for this Challenge. Out of these 12 submissions, 9 submissions passed the first stage of evaluation and were eligible for ranking. The top-ranked submission employed a conditional latent diffusion model, whereas the joint runners-up employed a generative adversarial network, followed by another network for image superresolution. In general, we observed that the overall ranking of the top 9 submissions according to our evaluation method (i) did not match the FID-based ranking, and (ii) differed with respect to individual feature families. Another important finding from our additional analyses was that different DGMs demonstrated similar kinds ofartifacts. This Grand Challenge highlighted the need for domain-specific evaluation to further DGM design as well as deployment. It also demonstrated that the specification of a DGM may differ depending on its intendeduse.

Read full abstract

Generative Adversarial Networks Research Articles

Related Topics

Articles published on Generative Adversarial Networks

LIC-CGAN: fast lithography latent images calculation method for large-area masks using deep learning

LW-DCGAN: a lightweight deep convolutional generative adversarial network for enhancing occluded face recognition

Enhanced Diabetes Detection and Blood Glucose Prediction Using TinyML-Integrated E-Nose and Breath Analysis: A Novel Approach Combining Synthetic and Real-World Data

Symmetric Connected U-Net with Multi-Head Self Attention (MHSA) and WGAN for Image Inpainting

DMFGAN: a multifeature data augmentation method for grape leaf disease identification.

A highly naturalistic facial expression generation method with embedded vein features based on diffusion model

A Decentralized Digital Watermarking Framework for Secure and Auditable Video Data in Smart Vehicular Networks

Research on Attack Detection for Traffic Signal Systems Based on Game Theory and Generative Adversarial Networks

Enhanced MRI-based brain tumour classification with a novel Pix2pix generative adversarial network augmentation framework.

Automated tree crown labeling with 3D radiative transfer modelling achieves human comparable performances for tree segmentation in semi-arid landscapes

A TransISP Based Image Enhancement Method for Visual Disbalance in Low‐light Images

An advancement in AdaSyn for imbalanced learning: An application to fraud detection in digital transactions

Automated construction site layout design system for prefabricated buildings using transformer based conditional GAN

Report on the AAPM grand challenge on deep generative modeling for learning medical image statistics.

Deep learning assisted cancer disease prediction from gene expression data using WT-GAN

Data-driven stochastic dynamic economic dispatch for combined heat and power systems using particle swarm optimization

TransImg: A Translation Algorithm of Visible-to-Infrared Image Based on Generative Adversarial Network

Retraction Note: Rainfall prediction using generative adversarial networks with convolution neural network

Frequency Domain-Based Super Resolution Using Two-Dimensional Structure Consistency for Ultra-High-Resolution Display

Comparison Between Gans and Diffusion Models in the Generation of Synthetic Images for Enhancing Tree Species Recognition

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generative Adversarial Networks Research Articles

Related Topics

Articles published on Generative Adversarial Networks

LIC-CGAN: fast lithography latent images calculation method for large-area masks using deep learning

LW-DCGAN: a lightweight deep convolutional generative adversarial network for enhancing occluded face recognition

Enhanced Diabetes Detection and Blood Glucose Prediction Using TinyML-Integrated E-Nose and Breath Analysis: A Novel Approach Combining Synthetic and Real-World Data

Symmetric Connected U-Net with Multi-Head Self Attention (MHSA) and WGAN for Image Inpainting

DMFGAN: a multifeature data augmentation method for grape leaf disease identification.

A highly naturalistic facial expression generation method with embedded vein features based on diffusion model

A Decentralized Digital Watermarking Framework for Secure and Auditable Video Data in Smart Vehicular Networks

Research on Attack Detection for Traffic Signal Systems Based on Game Theory and Generative Adversarial Networks

Enhanced MRI-based brain tumour classification with a novel Pix2pix generative adversarial network augmentation framework.

Automated tree crown labeling with 3D radiative transfer modelling achieves human comparable performances for tree segmentation in semi-arid landscapes

A TransISP Based Image Enhancement Method for Visual Disbalance in Low‐light Images

An advancement in AdaSyn for imbalanced learning: An application to fraud detection in digital transactions

Automated construction site layout design system for prefabricated buildings using transformer based conditional GAN

Report on the AAPM grand challenge on deep generative modeling for learning medical image statistics.

Deep learning assisted cancer disease prediction from gene expression data using WT-GAN

Data-driven stochastic dynamic economic dispatch for combined heat and power systems using particle swarm optimization

TransImg: A Translation Algorithm of Visible-to-Infrared Image Based on Generative Adversarial Network

Retraction Note: Rainfall prediction using generative adversarial networks with convolution neural network

Frequency Domain-Based Super Resolution Using Two-Dimensional Structure Consistency for Ultra-High-Resolution Display

Comparison Between Gans and Diffusion Models in the Generation of Synthetic Images for Enhancing Tree Species Recognition