The article examines different approaches to parallelizing the conjugate gradient algorithms for shared and distributed memory multiprocessor systems. We use the standards of MPI and OpenMP libraries of the C programming language for parallel programming. Testing the algorithms was carried out on the HP BLc3000 supercomputer, which contains 392 Intel cores and the InfiniBand network. The article presents the test results – acceleration and efficiency of parallel implementation for each type of multiprocessor systems depending on the dimension of a system of linear equations and the number of parallel processes. We present short descriptions of parallelization methods and excerpts of the code.
Read full abstract