Training Bottleneck Research Articles

Deep neural networks have played a crucial role in the field of deep learning, achieving significant success in practical applications. The architecture of neural networks is key to their performance. In the past few years, these architectures have been manually designed by experts with rich domain knowledge. Additionally, the optimal neural network architecture can vary depending on specific tasks and data distributions. Neural Architecture Search (NAS) is a class of techniques aimed at automatically searching for and designing neural network architectures according to the given tasks and data. Specifically, evolutionary-computation-based NAS methods are known for their strong global search capability and have aroused widespread interest in recent years. Although evolutionary-computation-based NAS has achieved success in a wide range of research and applications, it still faces bottlenecks in training and evaluating a large number of individuals during optimization. In this study, we first devise a multi-objective evolutionary NAS framework based on a weight-sharing supernet to improve the search efficiency of traditional evolutionary-computation-based NAS. This framework combines the population optimization characteristic of evolutionary algorithms with the weight-sharing ideas in one-shot models. We then design a bi-population MOEA/D algorithm based on the proposed framework to effectively solve the NAS problem. By constructing two sub-populations with different optimization objectives, the algorithm can effectively explore network architectures of various sizes in complex search spaces. An inter-population communication mechanism further enhances the algorithm’s exploratory capability, enabling it to find network architectures with uniform distribution and high diversity. Finally, we conduct performance comparison experiments on image classification datasets of different scales and complexities. Experimental results demonstrate the effectiveness of the proposed multi-objective evolutionary NAS framework and the practicality and transferability of the introduced bi-population MOEA/D-based NAS method compared to existing state-of-the-art NAS methods.

Read full abstract

Spiking neural network (SNN) is a brain-inspired model with more spatio-temporal information processing capacity and computational energy efficiency. However, with the increasing depth of SNNs, the memory problem caused by the weights of SNNs has gradually attracted attention. In this study, we propose an ultra-low latency adaptive local binary spiking neural network (ALBSNN) with accuracy loss estimators, which dynamically selects the network layers to be binarized to ensure a balance between quantization degree and classification accuracy by evaluating the error caused by the binarized weights during the network learning process. At the same time, to accelerate the training speed of the network, the global average pooling (GAP) layer is introduced to replace the fully connected layers by combining convolution and pooling. Finally, to further reduce the error caused by the binary weight, we propose binary weight optimization (BWO), which updates the overall weight by directly adjusting the binary weight. This method further reduces the loss of the network that reaches the training bottleneck. The combination of the above methods balances the network's quantization and recognition ability, enabling the network to maintain the recognition capability equivalent to the full precision network and reduce the storage space by more than 20%. So, SNNs can use a small number of time steps to obtain better recognition accuracy. In the extreme case of using only a one-time step, we still can achieve 93.39, 92.12, and 69.55% testing accuracy on three traditional static datasets, Fashion- MNIST, CIFAR-10, and CIFAR-100, respectively. At the same time, we evaluate our method on neuromorphic N-MNIST, CIFAR10-DVS, and IBM DVS128 Gesture datasets and achieve advanced accuracy in SNN with binary weights. Our network has greater advantages in terms of storage resources and training time.

Read full abstract

Training Bottleneck Research Articles

Related Topics

Articles published on Training Bottleneck

Efficient Training of Graph Neural Networks on Large Graphs

Multi-Objective Evolutionary Neural Architecture Search with Weight-Sharing Supernet

Eliminating Data Processing Bottlenecks in GNN Training over Large Graphs via Two-level Feature Compression

SIMPLE: Efficient Temporal Graph Neural Network Training at Scale with Dynamic Data Placement

Bilateral Supervision Network for Semi-Supervised Medical Image Segmentation.

Computation and Communication Efficient Federated Learning With Adaptive Model Pruning

Uncorking the bottleneck in anaesthesia training: novel approaches to a growing crisis

Overcoming training bottlenecks: mixed-methods evaluation of digital training for non-specialists in postnatal depression self-help treatment

ALBSNN: ultra-low latency adaptive local binary spiking neural network with accuracy loss estimator.

The growing bottlenecks in specialty training

Beyond Training the Next Generation of Physicians: The Unmeasured Value Added by Residents to Teaching Hospitals and Communities.

AFedAvg: communication-efficient federated learning aggregation with adaptive communication frequency and gradient sparse

DisCOV: Distributed COVID-19 Detection on X-Ray Images With Edge-Cloud Collaboration

A Bayesian Domain Adversarial Neural Network for Corn Yield Prediction

SmaQ: Smart Quantization for DNN Training by Exploiting Value Clustering

Efficient Communication Scheduling for Parameter Synchronization of DML in Data Center Networks

Empirical analysis of performance bottlenecks in graph neural network training and inference with GPUs

Understanding the drivers of bottlenecks in RANZCP training: modelling and a calculator to determining sustainable trainee intake.

On the Looming Physician Shortage and Strategic Expansion of Graduate Medical Education.

MOSS: End-to-End Dialog System Framework with Modular Supervision

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Training Bottleneck Research Articles

Related Topics

Articles published on Training Bottleneck

Efficient Training of Graph Neural Networks on Large Graphs

Multi-Objective Evolutionary Neural Architecture Search with Weight-Sharing Supernet

Eliminating Data Processing Bottlenecks in GNN Training over Large Graphs via Two-level Feature Compression

SIMPLE: Efficient Temporal Graph Neural Network Training at Scale with Dynamic Data Placement

Bilateral Supervision Network for Semi-Supervised Medical Image Segmentation.

Computation and Communication Efficient Federated Learning With Adaptive Model Pruning

Uncorking the bottleneck in anaesthesia training: novel approaches to a growing crisis

Overcoming training bottlenecks: mixed-methods evaluation of digital training for non-specialists in postnatal depression self-help treatment

ALBSNN: ultra-low latency adaptive local binary spiking neural network with accuracy loss estimator.

The growing bottlenecks in specialty training

Beyond Training the Next Generation of Physicians: The Unmeasured Value Added by Residents to Teaching Hospitals and Communities.

AFedAvg: communication-efficient federated learning aggregation with adaptive communication frequency and gradient sparse

DisCOV: Distributed COVID-19 Detection on X-Ray Images With Edge-Cloud Collaboration

A Bayesian Domain Adversarial Neural Network for Corn Yield Prediction

SmaQ: Smart Quantization for DNN Training by Exploiting Value Clustering

Efficient Communication Scheduling for Parameter Synchronization of DML in Data Center Networks

Empirical analysis of performance bottlenecks in graph neural network training and inference with GPUs

Understanding the drivers of bottlenecks in RANZCP training: modelling and a calculator to determining sustainable trainee intake.

On the Looming Physician Shortage and Strategic Expansion of Graduate Medical Education.

MOSS: End-to-End Dialog System Framework with Modular Supervision