Generative Adversarial Network Architecture Research Articles

Speech is an effective mode of communication that always conveys abundant and pertinent information, such as the gender, accent, and other distinguishing characteristics of the speaker. These distinctive characteristics allow researchers to identify human voices using artificial intelligence (AI) techniques, which are useful for forensic voice verification, security and surveillance, electronic voice eavesdropping, mobile banking, and mobile purchasing. Deep learning (DL) and other advances in hardware have piqued the interest of researchers studying automatic speaker identification (SI). In recent years, Generative Adversarial Networks (GANs) have demonstrated exceptional ability in producing synthetic data and improving the performance of several machine learning tasks. The capacity of Convolutional Wavelet Packet Transform (CWPT) and Generative Adversarial Networks are combined in this paper to propose a novel way of enhancing the accuracy and robustness of Speaker Recognition and Classification systems. Audio signals are dissected using the Convolutional Wavelet Packet Transform into a multi-resolution, time-frequency representation that faithfully preserves local and global characteristics. The improved audio features better precisely describe speech traits and handle pitch, tone, and pronunciation variations that are frequent in speaker recognition tasks. Using GANs to create synthetic speech samples, our suggested method GAN-CWPT enriches the training data and broadens the dataset's diversity. The generator and discriminator components of the GAN architecture have been tweaked to produce realistic speech samples with attributes quite similar to genuine speaker utterances. The new dataset enhances the Speaker Recognition and Classification system's robustness and generalization, even in environments with little training data. We conduct extensive tests on standard speaker recognition datasets to determine how well our method works. The findings demonstrate that, compared to conventional methods, the GAN-CWPTs combination significantly improves speaker recognition, classification accuracy, and efficiency. Additionally, the suggested model GAN-CWPT exhibits stronger generalization on unknown speakers and excels even with loud and poor audio inputs.

Read full abstract

Microstructures play a central role in determining the mechanical and functional properties of a material. An important aspect in computational materials science is to reliably predict the key microstructural features, and to utilise them in bridging the processing and properties of new materials. The existing microstructure characterization and reconstruction (MCR) techniques have inherent limitations in terms of the lack of design variables and information loss due to various assumptions. In previous studies, Generative Adversarial Network (GAN) models were trained using more than 10,000 image datasets, however without performance analysis using quantitative morphometric and statistical measures. In this perspective, the present work demonstrates the capability of GAN architectures to learn the mapping between random latent vectors and synthetic microstructural images, even in a limited data regime (1225 images). Three different architectures Deep Convolutional GAN (DCGAN) ,Wasserstein GAN- Gradient Penalty (WGAN-GP), StyleGAN2- Adaptive Discriminator Augmentation (ADA) were explored, together with comprehensive statistical and morphological analysis, while training on a publicly accessible Ti-6Al-4V (Ti64) alloy microstructure dataset. The StyleGAN2-ADA outperforms the other two GAN models to generate realistic synthetic microstructures of higher resolution, with good qualitative similarity to original images. The analysis of performance metrics, like Frechet Inception Distance (FID), Kernel Inception Distance (KID), and Inception Score (IS) scores, reveal that the generated image distribution is statistically close to the original distribution of microstructural features. The morphometric parameters, including α/β phase fractions, local α/β boundary thickness, and orientation of lamellar morphology, were also used to compare original and synthetic images quantitatively. More importantly, two-point correlation and t-stochastic neighbour embedding (t-SNE) illustrate the statistical similarity between the original and synthetic microstructures. Taken together, the present work establishes the capability of generative models like GAN in generating representative microstructures of Titanium alloy in a statistically reliable manner. Such an approach, when adopted, will accelerate the field of microstructure fingerprinting.

Read full abstract

Generative Adversarial Network Architecture Research Articles

Related Topics

Articles published on Generative Adversarial Network Architecture

P3 AD: Privacy-Preserved Payload Anomaly Detection for Industrial Internet of Things

A hybrid Cycle GAN-based lightweight road perception pipeline for road dataset generation for Urban mobility.

Methods of applied utilization of generative adversarial networks in graphic data processing

Choosing only the best voice imitators: Top-K many-to-many voice conversion with StarGAN

MIGAN: GAN for facilitating malware image synthesis with improved malware classification on novel dataset

DA-VEGAN: Differentiably Augmenting VAE-GAN for microstructure reconstruction from extremely small data sets

Lensless Image Restoration Based on Multi-Stage Deep Neural Networks and Pix2pix Architecture

RAGAN: A Generative Adversarial Network for risk-aware trajectory prediction in multi-ship encounter situations

Generative Adversarial Network with Convolutional Wavelet Packet Transforms for Automated Speaker Recognition and Classification

Blind Attention Geometric Restraint Neural Network for Single Image Dynamic/Defocus Deblurring.

GENERATIVE ADVERSARIAL NETWORKS FOR IMAGE SYNTHESIS AND STYLE TRANSFER IN VIDEOS

Generative Adversarial Networks (GANs) for Audio-Visual Speech Recognition in Artificial Intelligence IoT

Utilizing Generative Adversarial Networks for Stable Structure Generation in Angry Birds

Multivariate Emulation of Kilometer-Scale Numerical Weather Predictions with Generative Adversarial Networks: A Proof of Concept

Four-channel generative adversarial networks can predict the distribution of reef-associated fish in the South and East China Seas

Adaptive Filter in Single Image SRGAN

Prior-guided generative adversarial network for mammogram synthesis

A deep adversarial approach for the generation of synthetic titanium alloy microstructures with limited training data

Synthetic dual-energy CT reconstruction from single-energy CT Using artificial intelligence.

Deep neural architecture for natural language image synthesis for Tamil text using BASEGAN and hybrid super resolution GAN (HSRGAN)

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generative Adversarial Network Architecture Research Articles

Related Topics

Articles published on Generative Adversarial Network Architecture

P3 AD: Privacy-Preserved Payload Anomaly Detection for Industrial Internet of Things

A hybrid Cycle GAN-based lightweight road perception pipeline for road dataset generation for Urban mobility.

Methods of applied utilization of generative adversarial networks in graphic data processing

Choosing only the best voice imitators: Top-K many-to-many voice conversion with StarGAN

MIGAN: GAN for facilitating malware image synthesis with improved malware classification on novel dataset

DA-VEGAN: Differentiably Augmenting VAE-GAN for microstructure reconstruction from extremely small data sets

Lensless Image Restoration Based on Multi-Stage Deep Neural Networks and Pix2pix Architecture

RAGAN: A Generative Adversarial Network for risk-aware trajectory prediction in multi-ship encounter situations

Generative Adversarial Network with Convolutional Wavelet Packet Transforms for Automated Speaker Recognition and Classification

Blind Attention Geometric Restraint Neural Network for Single Image Dynamic/Defocus Deblurring.

GENERATIVE ADVERSARIAL NETWORKS FOR IMAGE SYNTHESIS AND STYLE TRANSFER IN VIDEOS

Generative Adversarial Networks (GANs) for Audio-Visual Speech Recognition in Artificial Intelligence IoT

Utilizing Generative Adversarial Networks for Stable Structure Generation in Angry Birds

Multivariate Emulation of Kilometer-Scale Numerical Weather Predictions with Generative Adversarial Networks: A Proof of Concept

Four-channel generative adversarial networks can predict the distribution of reef-associated fish in the South and East China Seas

Adaptive Filter in Single Image SRGAN

Prior-guided generative adversarial network for mammogram synthesis

A deep adversarial approach for the generation of synthetic titanium alloy microstructures with limited training data

Synthetic dual-energy CT reconstruction from single-energy CT Using artificial intelligence.

Deep neural architecture for natural language image synthesis for Tamil text using BASEGAN and hybrid super resolution GAN (HSRGAN)