Abstract
As multi-core CPUs are widely used in SMP clusters, parallel programming should pay more attention on shared memory parallelization inside single node. Hybrid MPI+OpenMP programming is naturally a good model that combines the distributed memory parallelization between nodes in clusters and the shared memory parallelization on each node. In this paper, we propose a parallel MLMFA (multilevel fast multipole algorithm) approach based on hybrid MPI+OpenMP model. Performance of hybrid implementation is studied compared with our previous pure MPI version. Time cost for computation and communication and memory consumption are analyzed in detail. As most modern HPC systems are clusters of SMP, the implementation is relevant.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.