Abstract
The work is devoted to developing and testing parallel algorithms, suit of computer programs for numerical solution of CFD-problems on modern supercomputers. The paper summarizes our experience in solving various practical problems of gas dynamics. These problems include research of gas flow around bodies of complex shape by viscous gas flows with radiation processes near the surface of the bodies, calculations of jet streams in the open space and flows in micro channels of technical systems. We have developed approaches combining unstructured grids, domain decomposition technique, parallel implementation via MPI technology, OpenMP, CUDA. Calculations are performed on the heterogeneous computer systems with Graphical Processing Units, classical microprocessors (Central Processing Units) and use advanced parallel approaches.
Highlights
At present time there is a rapid increase in productivity of computer and supercomputer hardware
Returning to Earth after a flight into space is a fundamental problem, that demands to take into account gas-dynamic flow processes and the effects associated with radiation
We propose method that integrates a macroscopic approach and a method of molecular dynamics (MMD) [21,22,23]
Summary
At present time there is a rapid increase in productivity of computer and supercomputer hardware. The applications must explicitly copy the data in memory VPU or GPU and back. At present the developments of special parallel approaches to the implementation of the calculations on heterogeneous systems with different topologies and architectures are relevant [2, 3]. In this paper the Computational Fluid Dynamics (CFD) tasks on supercomputers with hybrid architecture of computing nodes, including CPU, VPU or GPU solving parallel technology is presented. Technology suggests that the discrete mathematical model of the CFD-process uses unstructured mesh and can be strongly spatially inhomogeneous. In this case, optimization techniques for balance of the computational load per a single node or a calculator inside node are implemented. Algorithms and block methods of exchange among the different devices of random access memory are proposed
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.