MNIST Research Articles

In research on building a one-shot learning neural network without pre-training using mass data, the limitation on the information obtained from a single training sample downgrades the performance of the network. In order to improve performance and take full advantage of the support set, in this study, we design three kinds of shadow nodes and propose a structure-based training method for a correlation-coefficient-based neural network. This training strategy focuses on branches that are not activated or inactivated as expected. In contrast to existing networks that optimize the parameters using back-propagation, the training method proposed in this paper optimizes the structure of the correlation-coefficient-based network by correcting its pixel errors. For the shadow nodes and training process based on this strategy, the intersection over union (IOU) of a detected target increases by 4.83% in the experiments when using the Fashion-Mnist dataset, increases by 4.02% when using the Omniglot dataset, and increases by 3.89% when using the Cifar-10 dataset. The samples in category "7" wrongly classified as "1" decreased by 27.32% when using the Mnist dataset after training. This training strategy, along with shadow nodes, makes the correlation-coefficient-based network a more practical model and enables the network to develop during the accumulation of reliable samples, thus making it more suitable for simple target detection projects that collect samples over time. Moreover, the shadow nodes and training method proposed in this paper supplement the non-gradient-based parameter-gaining strategy. Additionally, it is a new attempt to explore the imitation of a human's ability to learn a new pattern from a low number of references.

Read full abstract

A scalable (<130 nm) resistive switching memristor that features both filamentary and interfacial switching aimed at neuromorphic computing is developed in this study. The typically perceived noise or volatility was effectively harnessed as a controlled mechanism for interfacial switching. The multilayer structure for the proposed memristor enhances switching stability by curbing ionic overmigration and mitigating leakage paths. Furthermore, the memristors showcased their reliability by demonstrating more than 15 M cycles in the filamentary mode and 1 M pulses in the interfacial mode. Additionally, retention tests at 85 °C for 104 s confirmed the stability across different states, affirming its reliability as a nonvolatile CMOS-compatible element. While many studies validate performance solely on the MNIST data set, this work also evaluates more complex data sets, demonstrating the robustness of the demonstrated memristor in supervised learning. Specifically, supervised learning simulations on MNIST and fashion MNIST data sets indicated a high learning rate with <4% deviations from numerical training, while offline inference trained on CIFAR-10 and CIFAR-100 data sets revealed <2.5% and <7% deviations caused by programing error accumulation, even with increased memristor counts for these highly complex data sets. Unsupervised learning via spike-timing-dependent plasticity further highlights the potential of the developed memristor in bridging artificial and biological paradigms, offering a significant advance toward efficient and biologically inspired computing architectures.

Read full abstract

MNIST Research Articles

Articles published on MNIST

Effective theory of collective deep learning

The backpropagation algorithm implemented on spiking neuromorphic hardware

Nitrogen-doped carbon quantum dot-decorated In2O3 synaptic transistors for neuromorphic computing

Permutation-equivariant quantum convolutional neural networks

Artificial photoelectric synaptic devices with ferroelectric diode effect for high-performance neuromorphic computing

Analysis of ensemble machine learning classification comparison on the skin cancer MNIST dataset

An Empirical Study of WGAN and WGAN-GP for Enhanced Image Generation

Leveraging multiplexed metasurfaces for multi-task learning with all-optical diffractive processors

Federated Learning's Dynamic Defense Against Byzantine Attacks: Integrating SIFT-Wavelet and Differential Privacy for Byzantine Grade Levels Detection

Integrated convolutional kernel based on two-dimensional photonic crystals.

Enhancing federated learning with dynamic weight adjustment based on particle swarm optimization

Image classification with deconvolution operation and augmentation

Phase Change Memory Drift Compensation in Spiking Neural Networks Using a Non-Linear Current Scaling Strategy

Ferrimagnet-Based Neuromorphic Device Mimicking the Ventral Visual Pathway for High-Accuracy Target Recognition.

Structure-Based Training: A Training Method Aimed at Pixel Errors for a Correlation-Coefficient-Based Neural Network.

A step function based recursion method for 0/1 deep neural networks

ZnO-based artificial synaptic diodes with zero-read voltage for neural network computing

Leveraging Tunability of Localized-Interfacial Memristors for Efficient Handling of Complex Neural Networks.

Pruning convolution neural networks using filter clustering based on normalized cross-correlation similarity

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

MNIST Research Articles

Articles published on MNIST

Effective theory of collective deep learning

The backpropagation algorithm implemented on spiking neuromorphic hardware

Nitrogen-doped carbon quantum dot-decorated In2O3 synaptic transistors for neuromorphic computing

Permutation-equivariant quantum convolutional neural networks

Artificial photoelectric synaptic devices with ferroelectric diode effect for high-performance neuromorphic computing

Analysis of ensemble machine learning classification comparison on the skin cancer MNIST dataset

An Empirical Study of WGAN and WGAN-GP for Enhanced Image Generation

Leveraging multiplexed metasurfaces for multi-task learning with all-optical diffractive processors

Federated Learning's Dynamic Defense Against Byzantine Attacks: Integrating SIFT-Wavelet and Differential Privacy for Byzantine Grade Levels Detection

Integrated convolutional kernel based on two-dimensional photonic crystals.

Enhancing federated learning with dynamic weight adjustment based on particle swarm optimization

Image classification with deconvolution operation and augmentation

Phase Change Memory Drift Compensation in Spiking Neural Networks Using a Non-Linear Current Scaling Strategy

Ferrimagnet-Based Neuromorphic Device Mimicking the Ventral Visual Pathway for High-Accuracy Target Recognition.

Structure-Based Training: A Training Method Aimed at Pixel Errors for a Correlation-Coefficient-Based Neural Network.

A step function based recursion method for 0/1 deep neural networks

ZnO-based artificial synaptic diodes with zero-read voltage for neural network computing

Leveraging Tunability of Localized-Interfacial Memristors for Efficient Handling of Complex Neural Networks.

Pruning convolution neural networks using filter clustering based on normalized cross-correlation similarity