multi-GPU Systems Research Articles

This paper analyzes the optimization features of machine learning (ML) model training procedures using multi-GPU systems to enhance cyber security in telecommunication networks. A key aspect of the study is the use of data parallelism, which allows the distribution of the training load across multiple GPUs, significantly reducing training time and improving model accuracy-critical factors for rapid threat detection in cyberspace. A novel approach for optimizing data batch size using Mutual Information (MI) is proposed, which harmonizes the utilization of computational resources with the information content of the training data. MI helps to determine the optimal data batch size that minimizes training errors and improves model accuracy without a significant increase in training time. Experimental results demonstrate the substantial advantages of multi-GPU configurations compared to single-GPU setups, providing faster training and improved model accuracy. It was particularly emphasized that MI-guided batch size tuning significantly outperforms traditional manual tuning methods, ensuring higher validation accuracy and reducing training time. The study showed that the MI-based approach is an effective tool for addressing the problem of optimizing ML model training processes in real-world scenarios where cyber security is critical. The proposed methods allow ML models to train faster and more accurately identify potential threats, making them particularly relevant for telecommunication networks where a rapid response to new threats in real time is required. The implementation of modern computational technologies such as multi-GPU systems and MI-optimized training enhances the efficiency and accuracy of machine learning models. This, in turn, improves cyber security measures and ensures a more reliable defence of telecommunication networks against malicious attacks. It is noted that the proposed approaches can be adapted not only for cyber security but also for other areas where high model accuracy and fast training are important. Future research prospects include the development of new machine learning methods, particularly deep neural networks, the exploration of alternative computational architectures such as quantum computing or distributed systems, and their integration into real-time systems. Special attention should be paid to the ethical aspects of implementing automated cyber security systems, particularly in preventing bias in algorithms and ensuring fairness in their application.

Read full abstract

In most real-world networks, nodes/vertices tend to be organized into tightly-knit modules known as communities or clusters such that nodes within a community are more likely to be connected or related to one another than they are to the rest of the network. Community detection in a network (graph) is aimed at finding a partitioning of the vertices into communities. The goodness of the partitioning is commonly measured using modularity. Maximizing modularity is an NP-complete problem. In 2008, Blondel et al. introduced a multi-phase, multi-iteration heuristic for modularity maximization called the Louvain method. Owing to its speed and ability to yield high quality communities, the Louvain method continues to be one of the most widely used tools for serial community detection.Distributed multi-GPU systems pose significant challenges and opportunities for efficient execution of parallel applications. Graph algorithms, in particular, have been known to be harder to parallelize on such platforms, due to irregular memory accesses, low computation to communication ratios, and load balancing problems that are especially hard to address on multi-GPU systems.In this paper, we present our ongoing work on distributed-memory implementation of Louvain method on heterogeneous systems. We build on our prior work parallelizing the Louvain method for community detection on traditional CPU-only distributed systems without GPUs. Corroborated by an extensive set of experiments on multi-GPU systems, we demonstrate competitive performance to existing distributed-memory CPU-based implementation, up to 3.2× speedup using 16 nodes of OLCF Summit relative to two nodes, and up to 19× speedup relative to the NVIDIA RAPIDS® cuGraph® implementation on a single NVIDIA V100 GPU from DGX-2 platform, while achieving high quality solutions comparable to the original Louvain method. To the best of our knowledge, this work represents the first effort for community detection on distributed multi-GPU systems. Our approach and related findings can be extended to numerous other iterative graph algorithms on multi-GPU systems.

Read full abstract

multi-GPU Systems Research Articles

Related Topics

Articles published on multi-GPU Systems

Optimization of machine learning model training procedure on multi-gpu systems to enhance cyber security in telecommunication networks

Laminography as a tool for imaging large-size samples with high resolution.

Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data Visualization.

PARALiA: A Performance Aware Runtime for Auto-tuning Linear Algebra on Heterogeneous Systems

Exploring the Formation of Resistive Pseudodisks with the GPU Code Astaroth

Distributed out-of-memory NMF on CPU/GPU architectures

Effect-based Multi-viewer Caching for Cloud-native Rendering

Accelerating 4D image reconstruction for magnetic resonance-guided radiotherapy.

Warp-Aware Adaptive Energy Efficiency Calibration for Multi-GPU Systems

Enable high-resolution, real-time ensemble simulation and data assimilation of flood inundation using distributed GPU parallelization

Towards the Simulation of a Realistic Large-Scale Spiking Network on a Desktop Multi-GPU System.

Hybrid MPI and CUDA paralleled finite volume unstructured CFD simulations on a multi-GPU system

Computational scatter correction in near real-time with a fast Monte Carlo photon transport model for high-resolution flat-panel CT

Real-time processing pipeline for automatic streak detection in astronomical images implemented in a multi-GPU system

Evolutionary background-oriented schlieren tomography with self-adaptive parameter heuristics.

Towards scaling community detection on distributed-memory heterogeneous systems

Efficient microscopy image analysis on CPU-GPU systems with cost-aware irregular data partitioning

FL-MISR: fast large-scale multi-image super-resolution for computed tomography based on multi-GPU acceleration

Prismatic 2.0 – Simulation software for scanning and high resolution transmission electron microscopy (STEM and HRTEM)

Parallelization on Gauss Sieve Algorithm over Ideal Lattice

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

multi-GPU Systems Research Articles

Related Topics

Articles published on multi-GPU Systems

Optimization of machine learning model training procedure on multi-gpu systems to enhance cyber security in telecommunication networks

Laminography as a tool for imaging large-size samples with high resolution.

Adaptively Placed Multi-Grid Scene Representation Networks for Large-Scale Data Visualization.

PARALiA: A Performance Aware Runtime for Auto-tuning Linear Algebra on Heterogeneous Systems

Exploring the Formation of Resistive Pseudodisks with the GPU Code Astaroth

Distributed out-of-memory NMF on CPU/GPU architectures

Effect-based Multi-viewer Caching for Cloud-native Rendering

Accelerating 4D image reconstruction for magnetic resonance-guided radiotherapy.

Warp-Aware Adaptive Energy Efficiency Calibration for Multi-GPU Systems

Enable high-resolution, real-time ensemble simulation and data assimilation of flood inundation using distributed GPU parallelization

Towards the Simulation of a Realistic Large-Scale Spiking Network on a Desktop Multi-GPU System.

Hybrid MPI and CUDA paralleled finite volume unstructured CFD simulations on a multi-GPU system

Computational scatter correction in near real-time with a fast Monte Carlo photon transport model for high-resolution flat-panel CT

Real-time processing pipeline for automatic streak detection in astronomical images implemented in a multi-GPU system

Evolutionary background-oriented schlieren tomography with self-adaptive parameter heuristics.

Towards scaling community detection on distributed-memory heterogeneous systems

Efficient microscopy image analysis on CPU-GPU systems with cost-aware irregular data partitioning

FL-MISR: fast large-scale multi-image super-resolution for computed tomography based on multi-GPU acceleration

Prismatic 2.0 – Simulation software for scanning and high resolution transmission electron microscopy (STEM and HRTEM)

Parallelization on Gauss Sieve Algorithm over Ideal Lattice