Parallel Domain Research Articles

With the development of engineering technology, engineering has higher requirements for the accuracy and the scale of simulation calculation. The computational efficiency of traditional serial programs cannot meet the requirements of engineering. Therefore, reducing the calculation time of the temperature control simulation program has important engineering significance for real-time simulation of temperature field and stress field, and then adopting more reasonable temperature control and crack prevention measures. GPU parallel computing is introduced into the temperature control simulation program of massive concrete to solve this problem and the optimization is carried out. Considering factors such as GPU clock rate, number of cores, parallel overhead and Parallel Region, the improved GPU parallel algorithm analysis indicator formula is proposed. It makes up for the shortcomings of traditional formulas that focus only on time. According to this formula, when there are enough threads, the parallel effect is limited by the size of the parallel domain, and when the parallel domain is large enough, the efficiency is limited by the parallel overhead and the clock rate. This paper studies the optimal Kernel execution configuration. Shared memory is utilized to improve memory access efficiency by 155%. After solving the problem of bank conflicts, an accelerate rate of 437.5× was realized in the subroutine of the matrix transpose of the solver. The asynchronous parallel of data access and logical operation is realized on GPU by using CUDA Stream, which can overlap part of the data access time. On the basis of GPU parallelism, asynchronous parallelism can double the computing efficiency. Compared with the serial program, the accelerate rate of inner product matrix multiplication of the GPU asynchronous parallel program is 61.42×. This study further proposed a theoretical formula of data access overlap rate to guide the selection of the number of CUDA streams to achieve the optimal computing conditions. The GPU parallel program compiled and optimized by the CUDA Fortran platform can effectively improve the computational efficiency of the simulation program for concrete temperature control, and better serve engineering computing.

Read full abstract

Aerobraking is a process used to slow down and insert a spacecraft into a low orbit around a planet. It is composed of many orbital passages into the complex atmosphere of the planet, which is used for braking. The aerobraking atmospheric passages are challenging because of the high variability of the atmospheric environment. For this reason, autonomous aerobraking planning is essential for safety and mission performance. This paper develops a parallel domain randomized deep reinforcement learning architecture for autonomous decision-making in a stochastic environment, such as aerobraking atmospheric passages. In this context, the architecture is used for planning aerobraking maneuvers to avoid the occurrence of thermal violations during the atmospheric aerobraking passages and target a final low-altitude orbit. The parallel domain randomized deep reinforcement learning architecture is designed to account for large variability of the physical model, as well as uncertain conditions. Also, the parallel approach speeds up the training process for simulation-based applications, and the domain randomization improves resultant policy generalization. To use this architecture, a Markov-Decision process framework is developed for a general aerobraking-type mission. A three-dimensional running reward function, expressed in spacecraft state and action, is designed. This framework is applied to the 2001 Mars Odyssey aerobraking campaign, which is also used to verify the performance of the parallel domain randomized deep reinforcement learning architecture. With respect to the 2001 Mars Odyssey mission flight data and a Numerical Predictor Corrector (NPC)-based state-of-the-art heuristic for autonomous aerobraking, the proposed architecture outperforms the state-of-the-art heuristic algorithm with an average increase of 87.2 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> in the cumulative reward and a decrease of 97.5 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> in the number of thermal violations. Specifically, the proposed architecture is able to predict and avoid thermal violations while requiring fewer computational resources. Furthermore, it yields a decrease of 98.7 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> in the number of thermal violations with respect to the Mars Odyssey mission flight data and requires 13.9 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> fewer orbits, with a comparable aerobraking duration and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\Delta$</tex-math></inline-formula> V budget. Results also show that the proposed architecture can also learn a generalized policy in the presence of strong uncertainties, such as aggressive atmospheric density perturbations, different atmospheric density models, and a different simulator maximum step size and error accuracy. To this end, a generalization analysis is performed using out-of-distribution generalization environments. Results of the generalization analysis show that the architecture can perform safe aerobraking campaigns with only a maximum increase of 2 in the number of thermal violations for cases in which the simulator accurately described the physical model.

Read full abstract

Parallel Domain Research Articles

Related Topics

Articles published on Parallel Domain

Numerical study of turbulent eddy self-interaction in tokamaks with low magnetic shear. Part I: Linear simulations

High-throughput affinity measurements of direct interactions between activation domains and co-activators.

Probing intrinsic magnetic domain in bare CrI3 bulk with a magnetic force microscope

Domain composition and attention network trained with synthesized unlabeled images for generalizable medical image segmentation

Domain decomposition methods and acceleration techniques for the phase field fracture staggered solver

Variability in adolescent reception of parental support: Testing the domain-matching hypothesis.

An investigation into the reduction mechanism of temperature-magnetic stress relief based on DO3 crystal

A computational framework for pharmaco‐mechanical interactions in arterial walls using parallel monolithic domain decomposition methods

Enhanced insulation and ferro/piezo-electric properties of BiFeO3–BaTiO3via synchronous B-site co-doping and quenching treatment

A Parallel Domain Decomposition Method for the Fully-Mixed Stokes-Dual-Permeability Fluid Flow Model with Beavers-Joseph Interface Conditions

RUL prediction of rolling bearings across working conditions based on multi-scale convolutional parallel memory domain adaptation network

Research on the Application and Performance Optimization of GPU Parallel Computing in Concrete Temperature Control Simulation

A parallel domain decomposition method for identifying the space-time dependent diffusion coefficients of 3D parabolic problems

Temperature-drift effect analysis of microstrip filters based on parallel high-order DGTD and FETD method with memory reduction technique

Nuclear and chromatin rearrangement associate to epigenome and gene expression changes in a model of in vitro adipogenesis and hypertrophy

Influence of the grain boundary phase characteristics on the magnetic properties of Nd-Fe-B magnets

Autonomous Decision-Making for Aerobraking via Parallel Randomized Deep Reinforcement Learning

Massively parallel numerical simulation of 3D shock wave propagation based on JASMIN framework

Anterior and posterior imaging with hyperparallel OCT.

A Dimension-Oblivious Domain Decomposition Method Based on Space-Filling Curves

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Parallel Domain Research Articles

Related Topics

Articles published on Parallel Domain

Numerical study of turbulent eddy self-interaction in tokamaks with low magnetic shear. Part I: Linear simulations

High-throughput affinity measurements of direct interactions between activation domains and co-activators.

Probing intrinsic magnetic domain in bare CrI3 bulk with a magnetic force microscope

Domain composition and attention network trained with synthesized unlabeled images for generalizable medical image segmentation

Domain decomposition methods and acceleration techniques for the phase field fracture staggered solver

Variability in adolescent reception of parental support: Testing the domain-matching hypothesis.

An investigation into the reduction mechanism of temperature-magnetic stress relief based on DO3 crystal

A computational framework for pharmaco‐mechanical interactions in arterial walls using parallel monolithic domain decomposition methods

Enhanced insulation and ferro/piezo-electric properties of BiFeO3–BaTiO3via synchronous B-site co-doping and quenching treatment

A Parallel Domain Decomposition Method for the Fully-Mixed Stokes-Dual-Permeability Fluid Flow Model with Beavers-Joseph Interface Conditions

RUL prediction of rolling bearings across working conditions based on multi-scale convolutional parallel memory domain adaptation network

Research on the Application and Performance Optimization of GPU Parallel Computing in Concrete Temperature Control Simulation

A parallel domain decomposition method for identifying the space-time dependent diffusion coefficients of 3D parabolic problems

Temperature-drift effect analysis of microstrip filters based on parallel high-order DGTD and FETD method with memory reduction technique

Nuclear and chromatin rearrangement associate to epigenome and gene expression changes in a model of in vitro adipogenesis and hypertrophy

Influence of the grain boundary phase characteristics on the magnetic properties of Nd-Fe-B magnets

Autonomous Decision-Making for Aerobraking via Parallel Randomized Deep Reinforcement Learning

Massively parallel numerical simulation of 3D shock wave propagation based on JASMIN framework

Anterior and posterior imaging with hyperparallel OCT.

A Dimension-Oblivious Domain Decomposition Method Based on Space-Filling Curves