Asynchronous Parallel Research Articles

Based on the volume of fluid multiphase flow model and the overset mesh technique, a numerical method for an asynchronous parallel oblique water-entry super-cavitating projectile was established. Experimental studies of the oblique water-entry of a high-speed single-launch projectile were carried out to validate the viability of the numerical method. The paper performed the numerical simulations and analyses of cavity evolution and motion characteristics of the front and rear projectiles in different initial intervals and in two sequences of top-side water-entry projectile first and bottom-side water-entry projectile first. The results show that when the initial interval of the first launch projectile is 0.5 time the projectile length, the first launch projectile cannot produce a cavity to completely encapsulate the projectile due to the violent squeezing of the following launch projectile cavity, and its movement is seriously affected and eventually loses its trajectory stability. At the same time, the first launch projectile that enters water from top side is squeezed to a larger degree than the one from bottom side, and the wetting phenomenon occurs earlier and loses stability faster. As the initial interval increases, the influence of the following launch projectile cavity near the first launch projectile is weakened, and the first launch projectile in both water entry sequences move steadily. For the following launch projectile, due to the continuous influence of the first launch projectile cavity, its cavity is always asymmetrical, and its motion stability is affected. The following launch projectile deflects to the inner side and destabilizes when the initial interval is 0.5 times the projectile length. When the initial interval is 1 time the projectile length, it moves steadily. It deflects to the outer side and destabilizes when the initial interval is 2 and 3 times the projectile length. In addition, the motion characteristics of the following launch projectile are basically identical in two water-entry sequences.

Read full abstract

With the development of engineering technology, engineering has higher requirements for the accuracy and the scale of simulation calculation. The computational efficiency of traditional serial programs cannot meet the requirements of engineering. Therefore, reducing the calculation time of the temperature control simulation program has important engineering significance for real-time simulation of temperature field and stress field, and then adopting more reasonable temperature control and crack prevention measures. GPU parallel computing is introduced into the temperature control simulation program of massive concrete to solve this problem and the optimization is carried out. Considering factors such as GPU clock rate, number of cores, parallel overhead and Parallel Region, the improved GPU parallel algorithm analysis indicator formula is proposed. It makes up for the shortcomings of traditional formulas that focus only on time. According to this formula, when there are enough threads, the parallel effect is limited by the size of the parallel domain, and when the parallel domain is large enough, the efficiency is limited by the parallel overhead and the clock rate. This paper studies the optimal Kernel execution configuration. Shared memory is utilized to improve memory access efficiency by 155%. After solving the problem of bank conflicts, an accelerate rate of 437.5× was realized in the subroutine of the matrix transpose of the solver. The asynchronous parallel of data access and logical operation is realized on GPU by using CUDA Stream, which can overlap part of the data access time. On the basis of GPU parallelism, asynchronous parallelism can double the computing efficiency. Compared with the serial program, the accelerate rate of inner product matrix multiplication of the GPU asynchronous parallel program is 61.42×. This study further proposed a theoretical formula of data access overlap rate to guide the selection of the number of CUDA streams to achieve the optimal computing conditions. The GPU parallel program compiled and optimized by the CUDA Fortran platform can effectively improve the computational efficiency of the simulation program for concrete temperature control, and better serve engineering computing.

Read full abstract

Asynchronous Parallel Research Articles

Related Topics

Articles published on Asynchronous Parallel

Application of discrete random forest algorithm in multi-person asynchronous parallel disassembly sequence planning for hydropower station equipment maintenance

An Energy-Efficient Bayesian Neural Network Implementation Using Stochastic Computing Method.

Parallel PSO for Efficient Neural Network Training Using GPGPU and Apache Spark in Edge Computing Sets

Replica Exchange of Expanded Ensembles: A Generalized Ensemble Approach with Enhanced Flexibility and Parallelizability.

Client selection and resource scheduling in reliable federated learning for UAV-assisted vehicular networks

The application of chessboard game based on integrated learning and UCT algorithm in mental health and emotional regulation

Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis.

Imperative Process Algebra and Models of Parallel Computation

Analyzing cavity evolution and motion characteristics of asynchronous parallel oblique water-entry super-cavitating projectile

HASP: Hierarchical Asynchronous Parallelism for Multi-NN Tasks

Asynchronous Parallel Fuzzy Stochastic Gradient Descent for High-Dimensional Incomplete Data Representation

An Asynchronous Parallel I/O Framework for Mass Conservation Ocean Model

APapo: An asynchronous parallel optimization method for DNN models

A parallel strategy to accelerate neighborhood operation for raster data coordinating CPU and GPU

An asynchronous parallel benders decomposition method for stochastic network design problems

Research on the Application and Performance Optimization of GPU Parallel Computing in Concrete Temperature Control Simulation

Asynchronous Parallel Incremental Block-Coordinate Descent for Decentralized Machine Learning

Efficient multi-objective optimization through parallel surrogate-assisted local search with tabu mechanism and asynchronous option

A hybrid parallelization approach based on workers grouping algorithm

Correction: An asynchronous parallel high-throughput model calibration framework for crystal plasticity finite element constitutive models

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Asynchronous Parallel Research Articles

Related Topics

Articles published on Asynchronous Parallel

Application of discrete random forest algorithm in multi-person asynchronous parallel disassembly sequence planning for hydropower station equipment maintenance

An Energy-Efficient Bayesian Neural Network Implementation Using Stochastic Computing Method.

Parallel PSO for Efficient Neural Network Training Using GPGPU and Apache Spark in Edge Computing Sets

Replica Exchange of Expanded Ensembles: A Generalized Ensemble Approach with Enhanced Flexibility and Parallelizability.

Client selection and resource scheduling in reliable federated learning for UAV-assisted vehicular networks

The application of chessboard game based on integrated learning and UCT algorithm in mental health and emotional regulation

Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis.

Imperative Process Algebra and Models of Parallel Computation

Analyzing cavity evolution and motion characteristics of asynchronous parallel oblique water-entry super-cavitating projectile

HASP: Hierarchical Asynchronous Parallelism for Multi-NN Tasks

Asynchronous Parallel Fuzzy Stochastic Gradient Descent for High-Dimensional Incomplete Data Representation

An Asynchronous Parallel I/O Framework for Mass Conservation Ocean Model

APapo: An asynchronous parallel optimization method for DNN models

A parallel strategy to accelerate neighborhood operation for raster data coordinating CPU and GPU

An asynchronous parallel benders decomposition method for stochastic network design problems

Research on the Application and Performance Optimization of GPU Parallel Computing in Concrete Temperature Control Simulation

Asynchronous Parallel Incremental Block-Coordinate Descent for Decentralized Machine Learning

Efficient multi-objective optimization through parallel surrogate-assisted local search with tabu mechanism and asynchronous option

A hybrid parallelization approach based on workers grouping algorithm

Correction: An asynchronous parallel high-throughput model calibration framework for crystal plasticity finite element constitutive models